DATA SOURCES
NameData typesVersionLast update
Array ExpressGene Expression Data9/1/2015
cancer.govDrug Indications7/4/2019
ChEMBLCompounds, Bioactivity DataChEMBL_2512/10/2018
clinicaltrials.govClinical Trials5/1/2019
COSMICMutations, Copy NumberV7111/4/2014
Pathway CommonsPathwaysV96/30/2017
PDB3D Crystal Structures10/22/2019
STRINGProtein to Protein Interaction DataV109/1/2015
TCGAMutations, Gene Expression Data6/15/2017
UniProtProteins2019_0910/16/2019
DATA COUNTS
Type#
Proteins
20,365 human, 561,176 all species
3D Structures
157,128 structures, 448,900 chains
Cell Lines
12,398
Compounds
2,079,339 unique structures
Organisms
2,148
Chemical Bioactivities
7,273,192 datapoints from 61,621 studies
Amplification / Copy Number
346,286 regions
Gene Expression
TCGA: 218,210,403
NCI60: 1,069,906
Interference
2,120 datapoints
Clinical Trials
254,899
Mutations
COSMIC Cell Lines Project: 1,179,585
COSMIC: 2,436,246
TCGA Cancer: 2,828,755
TCGA Metastatic: 325,253