2018
DOI: 10.1186/s13321-018-0271-1
|View full text |Cite
|
Sign up to set email alerts
|

A confidence predictor for logD using conformal regression and a support-vector machine

Abstract: Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water–octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds from the ChEMBL database, models are created and evaluated by a support-vector machine with a linear kernel using conformal prediction methodology, outputting prediction intervals at a specified confidence level. The r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
56
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
8
2

Relationship

3
7

Authors

Journals

citations
Cited by 44 publications
(56 citation statements)
references
References 37 publications
0
56
0
Order By: Relevance
“…An advantage of using unhashed fingerprints is that the features have a particular substructure assigned to them and can therefore be traced back to the actual molecular features. Thus using unhashed fingerprints mean that each feature represents a certain molecular substructure, and by assessing feature importance, this can be helpful in interpreting model results in a chemical context [4042].…”
Section: Resultsmentioning
confidence: 99%
“…An advantage of using unhashed fingerprints is that the features have a particular substructure assigned to them and can therefore be traced back to the actual molecular features. Thus using unhashed fingerprints mean that each feature represents a certain molecular substructure, and by assessing feature importance, this can be helpful in interpreting model results in a chemical context [4042].…”
Section: Resultsmentioning
confidence: 99%
“…The descriptors characterized the physicochemical, two-dimensional substructures, and drug-like properties of the studied compounds. Lapins et al ( 2018 ) constructed a QSAR model to predict the lipophilicity of compounds by using a signature molecular descriptor, which is related to the molecular two-dimensional topology information from 1.6 million compounds. Xu et al ( 2017 ) developed three deep learning-based QSAR models to evaluate the acute oral toxicity (AOT) of compounds.…”
Section: In Silico Approachesmentioning
confidence: 99%
“…In this study we used the Mondrian ACP implementation in the software CPSign (Arvidsson, 2016 ), leveraging the LIBLINEAR SVM implementation (Fan et al, 2008 ) together with the signatures molecular descriptor (Faulon et al, 2003 ). This descriptor is based on the neighboring of atoms in a molecule and has been shown to work well for QSAR studies (Alvarsson et al, 2016 ; Lapins et al, 2018 ) and for ligand-based target prediction (Alvarsson et al, 2014 ). Signatures were generated with height 1-3, which means that molecular sub-graphs including all atoms of distance 1, 2, or 3 from initial atoms, are generated.…”
Section: Methodsmentioning
confidence: 99%