2012
DOI: 10.3390/molecules17054791
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of Different Approaches to Define the Applicability Domain of QSAR Models

Abstract: Abstract:One of the OECD principles for model validation requires defining the Applicability Domain (AD) for the QSAR models. This is important since the reliable predictions are generally limited to query chemicals structurally similar to the training compounds used to build the model. Therefore, characterization of interpolation space is significant in defining the AD and in this study some existing descriptor-based approaches performing this task are discussed and compared by implementing them on existing v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
396
0
4

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 457 publications
(403 citation statements)
references
References 23 publications
3
396
0
4
Order By: Relevance
“…This may be attributed to the alignment of a number of factors, including increased availability of data, advances in data-mining methodologies as well as a more widespread appreciation of how to avoid many of the numerous pitfalls in building and applying QSAR models (Cherkasov et al 2014). Current trends in the field include efforts in chemical data curation (Williams et al 2012), automation of QSAR model building (Cox et al 2013), exploration of alternative descriptors (Cherkasov et al 2014), and efforts to help define the Applicability Domain (AD) of a given QSAR model (Sahigara et al 2012).…”
Section: Discussionmentioning
confidence: 99%
“…This may be attributed to the alignment of a number of factors, including increased availability of data, advances in data-mining methodologies as well as a more widespread appreciation of how to avoid many of the numerous pitfalls in building and applying QSAR models (Cherkasov et al 2014). Current trends in the field include efforts in chemical data curation (Williams et al 2012), automation of QSAR model building (Cox et al 2013), exploration of alternative descriptors (Cherkasov et al 2014), and efforts to help define the Applicability Domain (AD) of a given QSAR model (Sahigara et al 2012).…”
Section: Discussionmentioning
confidence: 99%
“…geometry, range, distance or probability density function based approaches) proposing to define the applicability domain region of statistical models based on different algorithms. For more detailed information about the available approaches for defining the (Q)SAR model applicability domain, interested readers are encouraged to refer to the review papers by others (Jaworska, Aldenberg, & Nikolova, 2005;Sahigara, et al, 2012).…”
Section: Support Vector Machines (Svm)mentioning
confidence: 99%
“…To obtain predictions for any incoming sample set using the model, the first group of methods are used to ensure that the compounds are within the so-called 'active subspace': which essentially means that we are actually performing interpolation only, not extrapolation. [52,53] For the distancebased approach, a pre-defined statistic is calculated to quantify the degree of nearness of the test compounds to the training set and based on whether that statistic is above or below a certain cutoff value, predictions for those compounds are considered acceptable or not. [52,54] …”
Section: Applicability Domain Of Qsar Modelsmentioning
confidence: 99%
“…[52,53] For the distancebased approach, a pre-defined statistic is calculated to quantify the degree of nearness of the test compounds to the training set and based on whether that statistic is above or below a certain cutoff value, predictions for those compounds are considered acceptable or not. [52,54] …”
Section: Applicability Domain Of Qsar Modelsmentioning
confidence: 99%