2011
DOI: 10.1093/nar/gkr777
|View full text |Cite
|
Sign up to set email alerts
|

ChEMBL: a large-scale bioactivity database for drug discovery

Abstract: ChEMBL is an Open Data database containing binding, functional and ADMET information for a large number of drug-like bioactive compounds. These data are manually abstracted from the primary published literature on a regular basis, then further curated and standardized to maximize their quality and utility across a wide range of chemical biology and drug-discovery research problems. Currently, the database contains 5.4 million bioactivity measurements for more than 1 million compounds and 5200 protein targets. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

6
2,963
1
12

Year Published

2013
2013
2021
2021

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 3,493 publications
(3,103 citation statements)
references
References 33 publications
6
2,963
1
12
Order By: Relevance
“…More than 400 million coordinate sets were downloaded in 2013 from the wwPDB partner sites. Both the utility and the uniformity of PDB data have enabled the development of other databases and datarelated resources, including resources for drug discovery (for a review see [32]); resources focused on small molecules and ligands such as ChEMBL [33], DrugBank [34], BindingDB [35], BindingMOAD [36], and PDBBind [37]; protein structure classification and annotation resources, such as CATH [38,39], SCOP [40][41][42], and PDBsum [43,44]; and focused, specialty annotation resources such as Protein Data Bank of Transmembrane Proteins (PDBTM) [45], ArchDB for functional loops in structures [46], and 3did for protein-protein interaction surfaces [47]. These resources are frequently compiled in the annual Database Issue of Nucleic Acids Research.…”
Section: Current Capabilities and Usagementioning
confidence: 99%
“…More than 400 million coordinate sets were downloaded in 2013 from the wwPDB partner sites. Both the utility and the uniformity of PDB data have enabled the development of other databases and datarelated resources, including resources for drug discovery (for a review see [32]); resources focused on small molecules and ligands such as ChEMBL [33], DrugBank [34], BindingDB [35], BindingMOAD [36], and PDBBind [37]; protein structure classification and annotation resources, such as CATH [38,39], SCOP [40][41][42], and PDBsum [43,44]; and focused, specialty annotation resources such as Protein Data Bank of Transmembrane Proteins (PDBTM) [45], ArchDB for functional loops in structures [46], and 3did for protein-protein interaction surfaces [47]. These resources are frequently compiled in the annual Database Issue of Nucleic Acids Research.…”
Section: Current Capabilities and Usagementioning
confidence: 99%
“…For huge datasets, such as ChEMBL (Gaulton et al, 2012), BindingDB (Liu et al, 2007) and ChemBank (Seiler et al, 2008), the molecules need to be clustered by similarity to reduce the size of the task. An interactive display of millions of leafs is not possible with current technology.…”
Section: Clustering Of Molecules For Very Large Datasetsmentioning
confidence: 99%
“…A molecules' ability to inhibit protein-protein interactions may also be predicted (Kuenemann et al, 2015). In order to discover such interactions, researchers must harness large databases, including PubChem (Wang et al, 2012) and ChEMBL (Gaulton et al, 2012), which contain vast amounts of heterogeneous biological and chemical data. The size and complexity of these datasets necessitate automated tools to explore available chemical space in order to determine chemical relationships and predict potential interactions.…”
Section: Introductionmentioning
confidence: 99%
“…In a recent analysis 14 , 18,208 activity cliffs were extracted from more than 41,000 unique ChEMBL 15 (release 15) compounds with activity against the current spectrum of human targets. Then, differences in LE between lowly and highly potent cliff partners were systematically assessed.…”
Section: Ligand Efficiency and Lipophilic Efficiencymentioning
confidence: 99%