2005
DOI: 10.1021/ci0500381
|View full text |Cite
|
Sign up to set email alerts
|

A Stepwise Approach for Defining the Applicability Domain of SAR and QSAR Models

Abstract: A stepwise approach for determining the model applicability domain is proposed. Four stages are applied to account for the diversity and complexity of the current SAR/QSAR models, reflecting their mechanistic rationality (including metabolic activation of chemicals) and transparency. General parametric requirements are imposed in the first stage, specifying in the domain only those chemicals that fall in the range of variation of the physicochemical properties of the chemicals in the training set. The second s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
181
0
10

Year Published

2007
2007
2014
2014

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 259 publications
(192 citation statements)
references
References 19 publications
1
181
0
10
Order By: Relevance
“…Skew within the descriptor space has led [19] to suggest that, in addition to a "general requirements" domain (which includes, but is not limited to, molecular properties) and a structural domain, a "mechanistic domain" is also required. This study indicates that an understanding of the mechanistic domain is the most critical criterion to understand the accuracy of, and hence the confidence that may be placed in, predictions.…”
Section: Discussionmentioning
confidence: 99%
“…Skew within the descriptor space has led [19] to suggest that, in addition to a "general requirements" domain (which includes, but is not limited to, molecular properties) and a structural domain, a "mechanistic domain" is also required. This study indicates that an understanding of the mechanistic domain is the most critical criterion to understand the accuracy of, and hence the confidence that may be placed in, predictions.…”
Section: Discussionmentioning
confidence: 99%
“…In this scheme, each non-hydrogen atom in a mole- [14,17], the first-order scheme with the simple atom-type discrimination as described has proven useful for the present purpose. The Dice similarity [18] has been used as the similarity measure between two compounds A and B, Sim(A, B), building on the ACF characterization of the chemical structures:…”
Section: Acf-based K Nearest Neighbor Modelmentioning
confidence: 99%
“…The problem can be solved using the Data Description Analysis based on 1-SVM technique. Unlike some applicability domains techniques earlier described in the literature, this approach: (i) is not confined to any model-specific information (2) can handle large number of descriptors; (3) involves low (zero) variance descriptors; (4) can even be used in the framework of descriptor-less approaches based on chemical graph kernels. An application of 1-SVM improves performance of regression models by rejecting compounds (outliers) dissimilar to the training set.…”
Section: Discussionmentioning
confidence: 99%
“…The problem of defining the applicability domain (AD) of a QSAR/QSPR model is one of the hottest topics in chemoinformatics (see reviews [1][2][3] ). Surprisingly that this concept is currently used almost exclusively in chemoinformatics and still very little has been discussed in mathematical statistics and machine learning theory.…”
Section: Introductionmentioning
confidence: 99%