2019
DOI: 10.1186/s13321-019-0384-1
|View full text |Cite
|
Sign up to set email alerts
|

Open-source QSAR models for pKa prediction using multiple machine learning approaches

Abstract: Background The logarithmic acid dissociation constant pKa reflects the ionization of a chemical, which affects lipophilicity, solubility, protein binding, and ability to pass through the plasma membrane. Thus, pKa affects chemical absorption, distribution, metabolism, excretion, and toxicity properties. Multiple proprietary software packages exist for the prediction of pKa, but to the best of our knowledge no free and open-source programs exist for this purpose. Using a freely available data se… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

4
103
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 130 publications
(107 citation statements)
references
References 44 publications
4
103
0
Order By: Relevance
“…In most cases, 2D-and 3D-QSAR studies are commonly used to evaluate the series of chemical compounds [86][87][88]. The 1D-QSAR approach allows for the determination of correlations for 1D descriptors (pKa, log P, structural fragments, and fingerprints) with biological activity [89]. To date, the 2D-QSAR method has been widely explored in many studies for the study of toxic or medicinal chemistry, which was first proposed in the early 1970s [90].…”
Section: Qsar Assaymentioning
confidence: 99%
“…In most cases, 2D-and 3D-QSAR studies are commonly used to evaluate the series of chemical compounds [86][87][88]. The 1D-QSAR approach allows for the determination of correlations for 1D descriptors (pKa, log P, structural fragments, and fingerprints) with biological activity [89]. To date, the 2D-QSAR method has been widely explored in many studies for the study of toxic or medicinal chemistry, which was first proposed in the early 1970s [90].…”
Section: Qsar Assaymentioning
confidence: 99%
“…All of the top-performing empirical methods were developed as commercial software that requires a license to run, and there were not any open-source alternatives for empirical p K a predictions. Since the completion of the blind challenge, two publications reported open-source machine learning-based p K a prediction methods, however, one can only predict the most acidic or most basic macroscopic p K a values of a molecule [54] and the second one is only trained for predicting p K a values of monoprotic molecules [55]. Recently, a p K a prediction methodology was published that describes a mixed approach of semi-empirical QM calculations and machine learning that can predict macroscopic p K a s of both mono- and polyprotic species [56].…”
Section: Resultsmentioning
confidence: 99%
“…In another study, three open-source QSAR models were built to predict the pKa of acids and bases: SVM in conjunction with k-NN, extreme gradient boosting and DNN [186]. PaDEL was used to produce molecular descriptors, fingerprints and fragment counts.…”
Section: Chemical Propertiesmentioning
confidence: 99%