2003
DOI: 10.1021/ci0340916
|View full text |Cite
|
Sign up to set email alerts
|

Drug Discovery Using Support Vector Machines. The Case Studies of Drug-likeness, Agrochemical-likeness, and Enzyme Inhibition Predictions

Abstract: Support Vector Machines (SVM) is a powerful classification and regression tool that is becoming increasingly popular in various machine learning applications. We tested the ability of SVM, in comparison with well-known neural network techniques, to predict drug-likeness and agrochemical-likeness for large compound collections. For both kinds of data, SVM outperforms various neural networks using the same set of descriptors. We also used SVM for estimating the activity of Carbonic Anhydrase II (CA II) enzyme in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
114
0
1

Year Published

2004
2004
2010
2010

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 193 publications
(116 citation statements)
references
References 20 publications
1
114
0
1
Order By: Relevance
“…We calculated the EF′ value for the retrieval of the top-ranked 18 of the 25 known actives (72% of actives) in a given test set, close to the 70% chosen by Halgren et (11) al. 32 The advantage of this EF′ statistic is that if two different sets of molecules' SVM activity scores give the same ranking for the 18th active, the modified enrichment factor will yield a higher value for the set of scores that rank the other 17 actives higher in one list than in the other; the conventional enrichment factor (eq 10) would not make this distinction.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We calculated the EF′ value for the retrieval of the top-ranked 18 of the 25 known actives (72% of actives) in a given test set, close to the 70% chosen by Halgren et (11) al. 32 The advantage of this EF′ statistic is that if two different sets of molecules' SVM activity scores give the same ranking for the 18th active, the modified enrichment factor will yield a higher value for the set of scores that rank the other 17 actives higher in one list than in the other; the conventional enrichment factor (eq 10) would not make this distinction.…”
Section: Methodsmentioning
confidence: 99%
“…Several studies have shown the SVM to be among the best methods for correctly classifying molecules. [8][9][10][11] A standard application of the SVM algorithm involves defining two classes of objects, determining a set of numbers that characterize each object, and using the SVM algorithm to calculate a classification model for the objects. After this training step, the SVM model is used to classify other objects.…”
Section: Introductionmentioning
confidence: 99%
“…[1][2][3][4][5][6][7] These descriptors were initially developed for the construction of quantitative structure-activity relationship (QSAR) and quantitative structure-property relationship (QSPR) of structurally related compounds. 8 They have been extensively used for the statistical-learning-based prediction of pharmacodynamic, pharmacokinetic, and toxicological properties of chemical agents including drug-likeness, [9][10][11] blood-brain barrier penetration, 12,13 human intestinal absorption, 4 drug-receptor binding, [14][15][16] drug metabolism, 17 cellular membrane partitioning, 18 chemical space navigation, 19 and antibacterial activity. 20,21 Some of these molecular descriptors are developed for the study of a particular type of properties of a group of structurally related chemical agents.…”
Section: Introductionmentioning
confidence: 99%
“…Similar trends between databases containing drugs, i.e., the MACCS-I Drug Data Report (MDDR) as well as the Comprehensive Medicinal Chemistry (CMC) and compound collections comprising mainly nondrugs such as the Available Chemical Directory (ACD), were reported before by Oprea and later by Zheng et al 29,51 While Figure 3a shows a clear separation of the maximum peaks of the molecular weight of nondrugs and drugs, Zernov et al yielded a stronger overlap between both distributions that intersect at around 360 that is higher than our separating margin. 19 Almost complete overlap between substances from the World Drugs Index (WDI) and those from the ACD were found by Li et al 17 Another obvious visual separation is found for the molar refractivity (see Figure 3b). The separating margin obtained by tree B is 40, identical to that obtained from tree A.…”
Section: Resultsmentioning
confidence: 89%
“…7,[11][12][13][14][15][16][17] However, even support vector machines that are known to be among the most accurate methods for classification tasks have so far not provided fully satisfactory results. [17][18][19][20] This can be, at least in part, explained by the implicit difficulties that arise from the data set.…”
Section: Introductionmentioning
confidence: 99%