2014
DOI: 10.1021/ci4006595
|View full text |Cite
|
Sign up to set email alerts
|

Assessment of Machine Learning Reliability Methods for Quantifying the Applicability Domain of QSAR Regression Models

Abstract: The vastness of chemical space and the relatively small coverage by experimental data recording molecular properties require us to identify subspaces, or domains, for which we can confidently apply QSAR models. The prediction of QSAR models in these domains is reliable, and potential subsequent investigations of such compounds would find that the predictions closely match the experimental values. Standard approaches in QSAR assume that predictions are more reliable for compounds that are "similar" to those in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
57
0
2

Year Published

2015
2015
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 59 publications
(59 citation statements)
references
References 22 publications
0
57
0
2
Order By: Relevance
“…With this purpose, Das et al 51 introduced the concept of applicability domain in the prediction of binding affinity in docking, although it is commonly used in other research areas such as QSAR. [99][100][101] Zilian and Sotriffer 63 also noted that larger overlaps between training and test sets explained the target-dependent performance of SFCscore RF . Being able to reliably predict how good binding affinity predictions are a priori would be a major advance in docking.…”
Section: Conclusion and Future Prospectsmentioning
confidence: 96%
See 1 more Smart Citation
“…With this purpose, Das et al 51 introduced the concept of applicability domain in the prediction of binding affinity in docking, although it is commonly used in other research areas such as QSAR. [99][100][101] Zilian and Sotriffer 63 also noted that larger overlaps between training and test sets explained the target-dependent performance of SFCscore RF . Being able to reliably predict how good binding affinity predictions are a priori would be a major advance in docking.…”
Section: Conclusion and Future Prospectsmentioning
confidence: 96%
“…Some authors have explained the performance of a SF a posteriori . With this purpose, Das et al introduced the concept of applicability domain in the prediction of binding affinity in docking, although it is commonly used in other research areas such as QSAR . Zilian and Sotriffer also noted that larger overlaps between training and test sets explained the target‐dependent performance of SFCscore RF .…”
Section: Conclusion and Future Prospectsmentioning
confidence: 99%
“…Similarly, a model will not accurately predict activity values outside the range covered in the training data (e.g., predict that a compound is active using a model trained on only inactive compounds). Hence the interest in developing reliable techniques to estimate errors in prediction for individual instances 38,45,46 .…”
Section: Introductionmentioning
confidence: 99%
“…Recently, another type of AD measurement was proposed by building an additional error model to assess the prediction reliability [9][10] . Benchmark studies 2,[11][12] on various AD metrics have previously been performed. Toplak et al 12 showed that methods of reliability indices were sensitive to dataset characteristics and to the regression method used in building the QSAR model.…”
Section: Introductionmentioning
confidence: 99%
“…Benchmark studies 2,[11][12] on various AD metrics have previously been performed. Toplak et al 12 showed that methods of reliability indices were sensitive to dataset characteristics and to the regression method used in building the QSAR model. Most of these AD metrics lack a rigorous scientific derivation.…”
Section: Introductionmentioning
confidence: 99%