2019
DOI: 10.1002/minf.201800163
|View full text |Cite
|
Sign up to set email alerts
|

Predicting pKa for Small Molecules on Public and In‐house Datasets Using Fast Prediction Methods Combined with Data Fusion

Abstract: Data fusion approach was investigated in the context of pK a prediction for 391 small molecules derived from a public data source as well as for 681 compounds from an internal corporate database. Four different pKa prediction methods (Simulations Plus ADMET-Predictor S + pKa, ACD/Labs Percepta Classic, ACD/Labs Percepta GALAS and Epik) were used to predict the most acidic or basic pKa for each of the compounds. By using data fusion, the median absolute error for the internal compounds was reduced from the best… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 18 publications
0
6
0
Order By: Relevance
“…In this regard and to rationalize our previous results, and more importantly for the development of new ones, we have performed an in silico study using ACD/PhysChem Suite software and the p K a GALAS algorithm available in it to calculate the acid ionization constant values of various oximes and other additives (Table ). Like the p K a Classic method, which is a variation of a classical Hammett–Taft approach and is available as an alternative within the said software, the GALAS algorithm is based on analogous fundamental considerations. However, instead of largely relying on equations and parameters quantified by other authors, it is developed entirely in-house by ACD/Labs, parameterized “from scratch” using an internal training set of >18 000 compounds with available experimental p K a measurement data. The custom nature of the p K a GALAS model allows for greater flexibility in using various ad hoc adjustments and modifications, going beyond the scope of the concepts considered in the classic Hammett–Taft approach where needed.…”
Section: Resultsmentioning
confidence: 99%
“…In this regard and to rationalize our previous results, and more importantly for the development of new ones, we have performed an in silico study using ACD/PhysChem Suite software and the p K a GALAS algorithm available in it to calculate the acid ionization constant values of various oximes and other additives (Table ). Like the p K a Classic method, which is a variation of a classical Hammett–Taft approach and is available as an alternative within the said software, the GALAS algorithm is based on analogous fundamental considerations. However, instead of largely relying on equations and parameters quantified by other authors, it is developed entirely in-house by ACD/Labs, parameterized “from scratch” using an internal training set of >18 000 compounds with available experimental p K a measurement data. The custom nature of the p K a GALAS model allows for greater flexibility in using various ad hoc adjustments and modifications, going beyond the scope of the concepts considered in the classic Hammett–Taft approach where needed.…”
Section: Resultsmentioning
confidence: 99%
“…Data fusion for prediction of pKa, in which the public data sources as well as an internal corporate database of small molecules are fused by using four different prediction models such as Simulations Plus, ADMET‐Predictor, ACD/Labs Percepta Classic, ACD/Labs Percepta GALAS and Epik. This new data fusion approach improves accuracy due to reduced median absolute errors of the internal compounds [121] . The permeability of curcumin in Caco‐ 2 cell monolayers and the poor absorption of curcumin through the gut lumen were reported due to degradation in the HBSS buffer across the Caco‐2 cell model during transportation [122] …”
Section: Insights Of Pka and Logp Of The Curcumin Prodrugs Or Derivat...mentioning
confidence: 99%
“…This new data fusion approach improves accuracy due to reduced median absolute errors of the internal compounds. [121] The permeability of curcumin in Caco-2 cell monolayers and the poor absorption of curcumin through the gut lumen were reported due to degradation in the HBSS buffer across the Caco-2 cell model during transportation. [122] The energy distribution pattern of the curcumin was predicted by using Gaussview by using B3LYP/6-311 + + (d, p) model.…”
Section: Insights Of Pka and Logp Of The Curcumin Prodrugs Or Derivat...mentioning
confidence: 99%
“…Thus, to understand the drug-like properties exhibited by a large series of molecules, the easiest and least expensive method involves prediction in silico, and the best compounds can be chosen as candidates and compounds with poor drug-like properties can be removed [18,19]. Thereafter, prediction software can also exhibit large deviations and interfere with reaching valid conclusions [20]. Thus, the best candidates need to be accurately measured before entering rigorous preclinical research [21].…”
Section: Introductionmentioning
confidence: 99%