2009
DOI: 10.1021/ci9000162
|View full text |Cite
|
Sign up to set email alerts
|

Prediction of HPLC Retention Index Using Artificial Neural Networks and IGroup E-State Indices

Abstract: A back-propagation artificial neural network (ANN) was used to create a 10-fold leave-10%-out cross-validated ensemble model of high performance liquid chromatography retention index (HPLC-RI) for a data set of 498 diverse druglike compounds. A 10-fold multiple linear regression (MLR) ensemble model of the same data was developed for comparison. Molecular structure was described using IGroup E-state indices, a novel set of structure-information representation (SIR) descriptors, along with molecular connectivit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
35
0

Year Published

2009
2009
2018
2018

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 29 publications
(36 citation statements)
references
References 31 publications
1
35
0
Order By: Relevance
“…A model with a higher correlation coefficient and lower standard error will likely be produced with a larger dataset. The development of a predictive model using our methods was demonstrated previously in a model developed for predicting chromatographic retention index values [39]. That model was based on a much larger dataset (n ϭ 498, 394 in training set), permitting the use of a greater number of structure descriptors and also allowing the use of a nonlinear artificial neural network analysis.…”
Section: Experimental Sy ϭ Intensitypmentioning
confidence: 98%
See 1 more Smart Citation
“…A model with a higher correlation coefficient and lower standard error will likely be produced with a larger dataset. The development of a predictive model using our methods was demonstrated previously in a model developed for predicting chromatographic retention index values [39]. That model was based on a much larger dataset (n ϭ 498, 394 in training set), permitting the use of a greater number of structure descriptors and also allowing the use of a nonlinear artificial neural network analysis.…”
Section: Experimental Sy ϭ Intensitypmentioning
confidence: 98%
“…Predictive models have been developed using these methods for a range of data types [41][42][43][44][45][46][47]. Thus, although the present model is not useful for predictive purposes, its development clearly suggests that modeling of a larger dataset could be predictive [23,28,39,[41][42][43][44]. Furthermore, the anticipated improved model may also assist in identifying further structural features important in determining precursor ion stability and fragmentation mechanisms [23,28,[42][43][44][45][46].…”
Section: Experimental Sy ϭ Intensitypmentioning
confidence: 99%
“…The IGroups methodology [7] explicitly represents every atom in a molecule by grouping atom level contributions from atoms that undergo similar noncovalent interactions in solution. Atom level contributions are quantified by the Atom-level Electrotopological State (E-State) [19].…”
Section: Methods: Model Datamentioning
confidence: 99%
“…MolFind/BioSM applies five additional orthogonal filters to reduce the number of false positives returned from the exact mass search [26]. Filters are: HPLC retention index (RI) [3, 67], Ecom 50 [2,3] drift index (DI) [5], biological/nonbiological (BioSM) [8] and CID (MS/MS) spectra [5, 9]. In filtering, predicted values for RI, Ecom 50 and DI are made for candidate compounds using computational models.…”
mentioning
confidence: 99%
See 1 more Smart Citation