2003
DOI: 10.1016/s0021-9673(03)00004-9
|View full text |Cite
|
Sign up to set email alerts
|

Classification and regression tree analysis for molecular descriptor selection and retention prediction in chromatographic quantitative structure–retention relationship studies

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
43
0

Year Published

2005
2005
2021
2021

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 80 publications
(44 citation statements)
references
References 27 publications
1
43
0
Order By: Relevance
“…This maximal tree will usually contain too many leaves and will overfit the learning data set, which will cause poor predictive abilities for new sample. 10 Therefore, the selection of an optimal tree with a good compromise between model fit and predictive properties is required. Thus, in general, CART analysis consists of three steps: (i) the maximal-tree building, (ii) the tree "pruning", which consists of the cutting-off of nodes to generate a sequence of simpler trees, and (iii) the optimal tree selection.…”
Section: Theorymentioning
confidence: 99%
See 3 more Smart Citations
“…This maximal tree will usually contain too many leaves and will overfit the learning data set, which will cause poor predictive abilities for new sample. 10 Therefore, the selection of an optimal tree with a good compromise between model fit and predictive properties is required. Thus, in general, CART analysis consists of three steps: (i) the maximal-tree building, (ii) the tree "pruning", which consists of the cutting-off of nodes to generate a sequence of simpler trees, and (iii) the optimal tree selection.…”
Section: Theorymentioning
confidence: 99%
“…10 CART is extensively used for modeling and classification in several areas, such as medical diagnosis and prognosis, [11][12][13] ecology, 14 agriculture 15 and chemistry. 10,[16][17] A very interesting advantage of CART is the possibility to deal with large numbers of both categorical and numerical variables. Another advantage is that no assumption about the underlying distribution of the predictor variables is required (even categorical variables can be used).…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…CART itself was already used successfully in quantitative structure-retention relationships by Put et al [12] and in quantitative structure-enantioselectivity relationships by Caetano et al [13]. In a QSAR-context, our group [14] and Bai et al [15] showed that CART could be used to predict the gastrointestinal absorption of drugs.…”
Section: Introductionmentioning
confidence: 97%