Most Ligand-Based Classification Benchmarks Reward Memorization Rather than Generalization

Wallach, Izhar; Heifets, Abraham

doi:10.1021/acs.jcim.7b00403

Cited by 201 publications

(317 citation statements)

References 39 publications

(101 reference statements)

Supporting

Mentioning

296

Contrasting

Order By: Relevance

“…The DEKOIS project (currently at version 2.0) [71,72] is intended to provide a "demanding" Among the 81 proteins in the DEKOIS set, we noted that some were included in our training set as well. To avoid any potential information leakage that might overestimate the performance we could expect in future applications [33], we completely removed these testcases. This left a set of 23 protein targets, each of which vScreenML had never seen before.…”

Section: Benchmarking Vscreenml Using Independent Test Setsmentioning

confidence: 99%

“…For these reasons, machine learning techniques may be especially well-suited for developing scoring functions that will provide a dramatic improvement in the ability to identify active compounds without human expert intervention. However, while machine learning may offer the potential to improve on the high false positive rate of current scoring function, further analysis has revealed that many methods to date reporting promising results in artificial benchmark experiments may have inadvertently overfit models to the training data [33]: this can be a subtle effect of information leakage, occurring when the validation/testing data are not truly non-redundant from the training data.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Machine learning classification can reduce false positives in structure-based virtual screening

Adeshina

Deeds

Karanicolas

2020

Preprint

View full text Add to dashboard Cite

With the recent explosion in the size of libraries available for screening, virtual screening is positioned to assume a more prominent role in early drug discovery's search for active chemical matter.Modern virtual screening methods are still, however, plagued with high false positive rates: typically, only about 12% of the top-scoring compounds actually show activity when tested in biochemical assays.We argue that most scoring functions used for this task have been developed with insufficient thoughtfulness into the datasets on which they are trained and tested, leading to overly simplistic models and/or overtraining. These problems are compounded in the literature because none of the studies reporting new scoring methods have validated their model prospectively within the same study. Here, we report a new strategy for building a training dataset (D-COID) that aims to generate highly-compelling decoy complexes that are individually matched to available active complexes. Using this dataset, we train a general-purpose classifier for virtual screening (vScreenML) that is built on the XGBoost framework of gradient-boosted decision trees. In retrospective benchmarks, our new classifier shows outstanding performance relative to other scoring functions. We additionally evaluate the classifier in a prospective context, by screening for new acetylcholinesterase inhibitors. Remarkably, we find that nearly all compounds selected by vScreenML show detectable activity at 50 µM, with 10 of 23 providing greater than 50% inhibition at this concentration. Without any medicinal chemistry optimization, the most potent hit from this initial screen has an IC50 of 280 nM, corresponding to a Ki value of 173 nM. These results support using the D-COID strategy for training classifiers in other computational biology tasks, and for vScreenML in virtual screening campaigns against other protein targets. Both D-COID and vScreenML are freely distributed to facilitate such efforts.

show abstract

Section: Benchmarking Vscreenml Using Independent Test Setsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Machine learning classification can reduce false positives in structure-based virtual screening

Adeshina

Deeds

Karanicolas

2020

Preprint

View full text Add to dashboard Cite

show abstract

“…This version of the CNN 16 was the subject of all the analyses so far carried out into the importance of the receptor in active/decoy classification. [1][2][3] DenseFS: A much deeper network with three sets of four densely connected convolutional layers followed by a fully-connected softmax layer and cross entropy loss. 18 This network significantly improved performance over the Gnina network on both held-out DUD-E targets and the ChEMBL set.…”

Section: Cnn Architecturesmentioning

confidence: 99%

“…A series of recent papers has shown that some deep learning methods designed for structurebased virtual screening can accurately separate actives and decoys when given only the structure of the ligand. [1][2][3] These results indicate that such methods are learning differences between the properties of actives and decoys, rather than the physical interactions between the receptor and the ligand. From this it is possible to conclude both that the methods will fail to generalize well (predict on datasets far removed from the training data), and that there are significant flaws in the current training datasets and/or regimens.…”

Section: Introductionmentioning

confidence: 97%

“…It was recently shown that model performance for some methods does not suffer significantly when ligand structures in the test set are given without the protein target/receptor. 2,3 The overall conclusion from these studies was that the methods are using very little if any information from the protein target. This finding links to the fact that in many of these datatsets, receptor-free methods such as k-nearest neighbours on ligand fingerprints perform almost as well as methods making use of the receptor.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Dataset Augmentation Allows Deep Learning-Based Virtual Screening To Better Generalize To Unseen Target Classes, And Highlight Important Binding Interactions

Scantlebury

Brown

Delft

et al. 2020

Preprint

View full text Add to dashboard Cite

Current deep learning methods for structure-based virtual screening take the structures of both the protein and the ligand as input but make little or no use of the protein structure when predicting ligand binding. Here we show how a relatively simple method of dataset augmentation forces such deep learning methods to take into account information from the protein. Models trained in this way are more generalisable (make better predictions on protein-ligand complexes from a different distribution 1 to the training data). They also assign more meaningful importance to the protein and ligand atoms involved in binding. Overall, our results show that dataset augmentation can help deep learning based virtual screening to learn physical interactions rather than dataset biases.

show abstract

Artificial Intelligence in Medicinal Chemistry

Griffen

Dossetter

Leach

et al. 2021

Burger's Medicinal Chemistry and Drug Discovery

View full text Add to dashboard Cite

Basic concepts in artificial intelligence (AI) are reviewed with reference to their application in medicinal chemistry. Challenges in the representation of molecules and biological data are discussed. The contrast between AI requiring autonomy and machine learning as a set of statistical techniques is highlighted. Key areas of AI in medicinal chemistry are then reviewed: data extraction, computer aided synthesis planning, compound activity, toxicity and ADME property prediction, automated compound design, and integrating all these tasks into automated drug discovery systems. Finally issues of the adoption of AI methods by medicinal chemists, future developments, and the ethics of AI in medicinal chemistry are addressed.

show abstract

Most Ligand-Based Classification Benchmarks Reward Memorization Rather than Generalization

Cited by 201 publications

References 39 publications

Machine learning classification can reduce false positives in structure-based virtual screening

Machine learning classification can reduce false positives in structure-based virtual screening

Dataset Augmentation Allows Deep Learning-Based Virtual Screening To Better Generalize To Unseen Target Classes, And Highlight Important Binding Interactions

Artificial Intelligence in Medicinal Chemistry

Contact Info

Product

Resources

About