2012
DOI: 10.5740/jaoacint.sge_goodarzi
|View full text |Cite
|
Sign up to set email alerts
|

Feature Selection Methods in QSAR Studies

Abstract: A quantitative structure-activity relationship (QSAR) relates quantitative chemical structure attributes (molecular descriptors) to a biological activity. QSAR studies have now become attractive in drug discovery and development because their application can save substantial time and human resources. Several parameters are important in the prediction ability of a QSAR model. On the one hand, different statistical methods may be applied to check the linear or nonlinear behavior of a data set. On the other hand,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
54
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 125 publications
(54 citation statements)
references
References 121 publications
0
54
0
Order By: Relevance
“…By removing those irrelevant features, the prediction performance and robustness of the calibration models can be improved, so reducing the dimension of a multivariate data set is one of the most important steps in data handling. Variable-selection and reduction techniques are used to simplify the interpretation of models with few variables, to improve the prediction performance, and to decrease the risk of overfitting and overtraining [122].…”
Section: Variable Selectionmentioning
confidence: 99%
“…By removing those irrelevant features, the prediction performance and robustness of the calibration models can be improved, so reducing the dimension of a multivariate data set is one of the most important steps in data handling. Variable-selection and reduction techniques are used to simplify the interpretation of models with few variables, to improve the prediction performance, and to decrease the risk of overfitting and overtraining [122].…”
Section: Variable Selectionmentioning
confidence: 99%
“…The selection of descriptors plays an important role in construction for the actual model. In this work, mRMR-BFS method (minimum redundancy maximum relevance-backward feature selection) [42, 43] was used for the selection of molecular descriptors. The support vector regression (SVR) model was established based on the feature selection results.…”
Section: Methodsmentioning
confidence: 99%
“…Descriptor selection is an important step for several reasons [10], including: (i) using only a few descriptors increases the interpretability and understanding of resulting models; (ii) It can reduce the risk of overfitting from noisy redundant molecular descriptors; (iii) it can provide faster and cost-effective models; and (4) it removes the Q5 activity cliff. However, noisy, redundant, or irrelevant descriptors should be removed in a way that the dimension of the input space is reduced without any loss of significant information [3].…”
Section: Introductionmentioning
confidence: 99%