2004
DOI: 10.1021/ci049766z
|View full text |Cite
|
Sign up to set email alerts
|

Toward an Optimal Procedure for PC-ANN Model Building:  Prediction of the Carcinogenic Activity of a Large Set of Drugs

Abstract: The performances of the three novel QSAR algorithms, principal component-artificial neural network modeling method combining with three factor selection procedures named eigenvalue ranking, correlation ranking, and genetic algorithm (ER-PC-ANN, CR-PC-ANN, PC-GA-ANN, respectively), are compared by application of these model to the prediction of the carcinogenic activity of a large set of drugs (735 drugs) belonging to a diverse type of compounds. A total number of 1350 theoretical descriptors are calculated for… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
39
0

Year Published

2005
2005
2018
2018

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 64 publications
(39 citation statements)
references
References 54 publications
0
39
0
Order By: Relevance
“…21 Hence, selecting the significant and informative PC's is the main problem in all of the PCA-based calibration methods. [22][23][24][25] Different methods have been addressed to select the significant PC's for calibration purposes. The simplest and most common one is a top-down variable selection where the PC's are ranked in the order of decreasing eigenvalues and the PC's with highest eigenvalue is considered as the most significant one and, subsequently, the PC's are introduced into the calibration model.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…21 Hence, selecting the significant and informative PC's is the main problem in all of the PCA-based calibration methods. [22][23][24][25] Different methods have been addressed to select the significant PC's for calibration purposes. The simplest and most common one is a top-down variable selection where the PC's are ranked in the order of decreasing eigenvalues and the PC's with highest eigenvalue is considered as the most significant one and, subsequently, the PC's are introduced into the calibration model.…”
Section: Introductionmentioning
confidence: 99%
“…However, the magnitude of an eigenvalue is not necessarily a measure of its significance for the calibration. 25 In the other method, which is called correlation ranking, the PC's are ranked by their correlation coefficient with the property and selected by the procedure discussed for eigenvalue ranking. 22,23 Better results are often achieved by this method.…”
Section: Introductionmentioning
confidence: 99%
“…The theoretical descriptors were reduced by the following methods: 1) descriptors that are constant or nearly constant have been eliminated, because these descriptors can not define the variation of the property with structure; 2) in order to decrease the redundancy existing in the descriptors data matrix, the correlation coefficients for all pairs of remaining descriptors were determined. If a correlation coefficient was higher than 0.91, the descriptor with lower correlation with the heat capacity was eliminated; 44,45 3) the method of stepwise multi-parameter linear regression was used to select the most important descriptors and to calculate the coefficients relating the heat capacity to the descriptors. 15 The MLR models were generated using spss/pc software package release 10.0.…”
Section: Methods and Proceduresmentioning
confidence: 99%
“…In the case of each MLR model, a feed-forward neural network with back-propagation of error algorithm was constructed to model the activity-structure relationships between the descriptors on one hand and inhibitory activity on the other hand. The model development in ANN and the network architecture is fully described by us [13] and others [14]. The data set was divided into training and external test sets.…”
Section: Principal Component-artificial Neural Network (Pc-ann) Analysismentioning
confidence: 99%
“…Both linear and nonlinear mapping functions can be modeled by configuring the network properly. To obtain powerful and accurate ANN models, one should train a subset of descriptors instead of all generated descriptors [10][11][12][13][14][15].…”
Section: Introductionmentioning
confidence: 99%