Exposing the limitations of molecular machine learning with activity cliffs.

Tilborg, Derek van; Alenicheva, Alisa; Grisoni, Francesca

doi:10.26434/chemrxiv-2022-mfq52-v3

Cited by 9 publications

(13 citation statements)

References 71 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…TDC refers to 12 data sets used as regression benchmarks and provided by the Therapeutic Data Commons. 47 ChEMBL refers to 30 data sets curated from the ChEMBL database 62 by van Tilborg et al 11 The relative RMSE is the average, normalized RMSE obtained from 5-fold cross validation. Details of the fingerprints and descriptors used as molecular representations can be found in Methods.…”

Section: ■ Resultsmentioning

confidence: 99%

“…Three sets of regression tasks were used in this work. Structure−property landscapes related to regression tasks were retrieved from the Therapeutic Data Commons (TDC) 47 using the Python library PyTDC (v. 0.3.6) and from the previous work of van Tilborg et al 11 A total of 55 regression data sets, split across three groups, were considered.…”

Section: ■ Introductionmentioning

confidence: 99%

“…49 (2) The second group of data sets, referred to as "TDC", comprised 12 data sets that featured pharmaco-kinetic and toxicological properties and were obtained from the TDC: Caco2_Wang, 52 Lipophilicity_AstraZeneca, 53,54 Solubili-ty_AqSolDB, 5 5 HydrationFreeEnergy_FreeSolv, 5 6 PPBR_AZ, 54 VDss_Lombardo, 57 Half_Life_Obach, 58 Clear-ance_Hepatocyte_AZ, 54,59 Clearance_Microsome_AZ, 54,59 LD50_Zhu, 60 herg_central/hERG_at_1uM, 61 and herg_central/hERG_at_10uM. 61 (3) The third group of data sets, referred to as "ChEMBL", comprised 30 SAR data sets from ChEMBL 62 that were curated by van Tilborg et al 11 To reduce the computational cost of performing these tests, data set sizes were capped at 10,000 molecules; data sets containing a larger number of entries were subsampled at random (using a fixed seed for reproducibility).…”

Section: ■ Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Roughness of Molecular Property Landscapes and Its Impact on Modellability

Aldeghi

Graff

Frey

et al. 2022

J. Chem. Inf. Model.

View full text Add to dashboard Cite

In molecular discovery and drug design, structure–property relationships and activity landscapes are often qualitatively or quantitatively analyzed to guide the navigation of chemical space. The roughness (or smoothness) of these molecular property landscapes is one of their most studied geometric attributes, as it can characterize the presence of activity cliffs, with rougher landscapes generally expected to pose tougher optimization challenges. Here, we introduce a general, quantitative measure for describing the roughness of molecular property landscapes. The proposed roughness index (ROGI) is loosely inspired by the concept of fractal dimension and strongly correlates with the out-of-sample error achieved by machine learning models on numerous regression tasks.

show abstract

Section: ■ Resultsmentioning

confidence: 99%

Section: ■ Introductionmentioning

confidence: 99%

Section: ■ Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Roughness of Molecular Property Landscapes and Its Impact on Modellability

Aldeghi

Graff

Frey

et al. 2022

J. Chem. Inf. Model.

View full text Add to dashboard Cite

show abstract

“…8,9 ACs were rst accurately predicted using support vector machine (SVM) modeling on the basis of special kernel functions enabling compound pair predictions. 9 These ndings have also catalyzed further AC predictions using SVR variants [10][11][12] and other methods, [13][14][15][16][17][18] as discussed below. Recently, various deep neural network architectures have been used to predict ACs from images 14,15 and molecular graphs using representation learning 16 or derive regression models for potency prediction of AC compounds.…”

Section: Introductionmentioning

confidence: 99%

“…Recently, various deep neural network architectures have been used to predict ACs from images 14,15 and molecular graphs using representation learning 16 or derive regression models for potency prediction of AC compounds. 17,18 In this work, we further extend this methodological spectrum by introducing chemical language models for combined AC prediction and generative compound design. Compared to earlier studies predicting ACs using classication models, the approach presented herein was designed to extend AC predictions with the capacity to produce new AC compounds, thus integrating predictive and generative modeling in the context of AC analysis and AC-based compound design.…”

Section: Introductionmentioning

confidence: 99%

DeepAC – conditional transformer-based chemical language model for the prediction of activity cliffs formed by bioactive compounds

2022

View full text Add to dashboard Cite

show abstract

Large-scale prediction of activity cliffs using machine and deep learning methods of increasing complexity

2023

View full text Add to dashboard Cite

Activity cliffs (AC) are formed by pairs of structural analogues that are active against the same target but have a large difference in potency. While much of our knowledge about ACs has originated from the analysis and comparison of compounds and activity data, several studies have reported AC predictions over the past decade. Different from typical compound classification tasks, AC predictions must be carried out at the level of compound pairs representing ACs or nonACs. Most AC predictions reported so far have focused on individual methods or comparisons of two or three approaches and only investigated a few compound activity classes (from 2 to 10). Although promising prediction accuracy has been reported in most cases, different system set-ups, AC definitions, methods, and calculation conditions were used, precluding direct comparisons of these studies. Therefore, we have carried out a large-scale AC prediction campaign across 100 activity classes comparing machine learning methods of greatly varying complexity, ranging from pair-based nearest neighbor classifiers and decision tree or kernel methods to deep neural networks. The results of our systematic predictions revealed the level of accuracy that can be expected for AC predictions across many different compound classes. In addition, prediction accuracy did not scale with methodological complexity but was significantly influenced by memorization of compounds shared by different ACs or nonACs. In many instances, limited training data were sufficient for building accurate models using different methods and there was no detectable advantage of deep learning over simpler approaches for AC prediction. On a global scale, support vector machine models performed best, by only small margins compared to others including simple nearest neighbor classifiers. Graphical Abstract

show abstract

Exposing the limitations of molecular machine learning with activity cliffs.

Cited by 9 publications

References 71 publications

Roughness of Molecular Property Landscapes and Its Impact on Modellability

Roughness of Molecular Property Landscapes and Its Impact on Modellability

DeepAC – conditional transformer-based chemical language model for the prediction of activity cliffs formed by bioactive compounds

Large-scale prediction of activity cliffs using machine and deep learning methods of increasing complexity

Contact Info

Product

Resources

About