Prediction of p<i>K</i><sub>a</sub> Using Machine Learning Methods with Rooted Topological Torsion Fingerprints: Application to Aliphatic Amines

Lu, Yuele; Anand, Shankara; Shirley, William A.; Gedeck, Peter; Kelley, Brian P.; Skolnik, Suzanne; Rodde, Stephane; Nguyen, Mai P.; Lindvall, Mika K.; Wang, Jia

doi:10.1021/acs.jcim.9b00498

Cited by 36 publications

(30 citation statements)

References 72 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…5,6,14 The LFER models apply the Hammett equations to predict pK a by classifying the molecule to a parent class and modifying the pK a value of the parent class with a property of its substituents. Machine learning models 11,12,15 usually use a molecular environment rooted at the ionization center as the descriptor to develop the pK a prediction approach by learning from data.…”

Section: ■ Introductionmentioning

confidence: 99%

MolGpka: A Web Server for Small Molecule pK_a Prediction Using a Graph-Convolutional Neural Network

Pan

Wang

Li³

et al. 2021

J. Chem. Inf. Model.

130

102

View full text Add to dashboard Cite

pK a is an important property in the lead optimization process since the charge state of a molecule in physiologic pH plays a critical role in its biological activity, solubility, membrane permeability, metabolism, and toxicity. Accurate and fast estimation of small molecule pK a is vital during the drug discovery process. We present MolGpKa, a web server for pK a prediction using a graph-convolutional neural network model. The model works by learning pK a related chemical patterns automatically and building reliable predictors with learned features. ACD/pK a data for 1.6 million compounds from the ChEMBL database was used for model training. We found that the performance of the model is better than machine learning models built with human-engineered fingerprints. Detailed analysis shows that the substitution effect on pK a is well learned by the model. MolGpKa is a handy tool for the rapid estimation of pK a during the ligand design process. The MolGpKa server is freely available to researchers and can be accessed at https://xundrug.cn/molgpka.

show abstract

Section: ■ Introductionmentioning

confidence: 99%

MolGpka: A Web Server for Small Molecule pK_a Prediction Using a Graph-Convolutional Neural Network

Pan

Wang

Li³

et al. 2021

J. Chem. Inf. Model.

130

102

View full text Add to dashboard Cite

show abstract

“…In reaction prediction, ML algorithms have proven helpful in identifying the most likely types of reactions applicable to a given substrate under given reaction conditions, [1f, 4, 6e] and in the choice of site‐ or regioisomers that can form [7] . For relatively simple substrates and non‐stereoselective chemistries with sufficient numbers of literature precedents, the accuracy of these models has been satisfactory, reflecting the adequacy of molecular descriptors embodying information about atomic composition and connectivity (various 2D and 3D fingerprints, [8a–d] or descriptor libraries like DScribe [8e] ), electronic effects of substituents (e.g., Hammett constants [7a] or QM‐derived measures [9] ), as well as some measures of steric bulk in the vicinity of reaction center (e.g., TSEI indices we used to predict the outcomes of Diels Alder reactions [7a] ). Simultaneously, there has been progress in developing predictors capturing stereochemical information [1f, 10] and in predicting outcomes of stereoselective reactions controlled by chiral catalysts (cf.…”

Section: Figurementioning

confidence: 99%

Scaffold‐Directed Face Selectivity Machine‐Learned from Vectors of Non‐covalent Interactions

et al. 2021

View full text Add to dashboard Cite

This work describes a method to vectorize and Machine-Learn, ML, non-covalent interactions responsible for scaffold-directed reactions important in synthetic chemistry. Models trained on this representation predict correct face of approach in ca. 90 % of Michael additions or Diels-Alder cycloadditions. These accuracies are significantly higher than those based on traditional ML descriptors, energetic calculations, or intuition of experienced synthetic chemists. Our results also emphasize the importance of ML models being provided with relevant mechanistic knowledge; without such knowledge, these models cannot easily "transfer-learn" and extrapolate to previously unseen reaction mechanisms.

show abstract

“…[22][23][24][25][26] Applications of SVMs in chemistry include bioactivity prediction, toxicity-related properties and physicochemical property prediction. 1,[26][27][28][29] A dataset consisting of chemical structures or reactions must converted to a machine readable format before presented to a machine learning algorithm. Molecular descriptors are based on the structural, physiochemical, electronic, or topological nature of molecules.…”

Section: Introductionmentioning

confidence: 99%

“…40 Fingerprints have also been utilised in kernel-based QSAR/QSPR relationship models, using the Tanimoto or RBF kernel. [27][28][29] Molecular graphs are another two-dimensional representation that depict the atoms and bonds within molecules as a set of nodes and edges. The global molecular structure is considered, in contrast to the local environments in fingerprints.…”

Section: Introductionmentioning

confidence: 99%

Kernel Methods for Predicting Yields of Chemical Reactions

Haywood¹,

Redshaw²,

Hanson‐Heine³

et al. 2021

Preprint

View full text Add to dashboard Cite

The use of machine learning methods for the prediction of reaction yield is an emerging area. We demonstrate the applicability of support vector regression (SVR) for predicting reaction yields, using combinatorial data. Molecular descriptors used in regression tasks related to chemical reac?tivity have often been based on time-consuming, computationally demanding quantum chemical calculations, usually density functional theory. Structure-based descriptors (molecular fingerprints and molecular graphs) are quicker and easier to calculate, and are applicable to any molecule. In this study, SVR models built on structure-based descriptors were compared to models built on quantum chemical descriptors. The models were evaluated along the dimension of each reaction component in a set of Buchwald-Hartwig amination reactions. The structure-based SVR models out-performed the quantum chemical SVR models, along the dimension of each reaction compo?nent. The applicability of the models was assessed with respect to similarity to training. Prospec?tive predictions of unseen Buchwald-Hartwig reactions are presented for synthetic assessment, to validate the generalisability of the models, with particular interest along the aryl halide dimension.

show abstract

Prediction of pK_a Using Machine Learning Methods with Rooted Topological Torsion Fingerprints: Application to Aliphatic Amines

Cited by 36 publications

References 72 publications

MolGpka: A Web Server for Small Molecule pK_a Prediction Using a Graph-Convolutional Neural Network

MolGpka: A Web Server for Small Molecule pK_a Prediction Using a Graph-Convolutional Neural Network

Scaffold‐Directed Face Selectivity Machine‐Learned from Vectors of Non‐covalent Interactions

Kernel Methods for Predicting Yields of Chemical Reactions

Contact Info

Product

Resources

About

Prediction of pKa Using Machine Learning Methods with Rooted Topological Torsion Fingerprints: Application to Aliphatic Amines

Cited by 36 publications

References 72 publications

MolGpka: A Web Server for Small Molecule pKa Prediction Using a Graph-Convolutional Neural Network

MolGpka: A Web Server for Small Molecule pKa Prediction Using a Graph-Convolutional Neural Network

Scaffold‐Directed Face Selectivity Machine‐Learned from Vectors of Non‐covalent Interactions

Kernel Methods for Predicting Yields of Chemical Reactions

Contact Info

Product

Resources

About

Prediction of pK_a Using Machine Learning Methods with Rooted Topological Torsion Fingerprints: Application to Aliphatic Amines

MolGpka: A Web Server for Small Molecule pK_a Prediction Using a Graph-Convolutional Neural Network

MolGpka: A Web Server for Small Molecule pK_a Prediction Using a Graph-Convolutional Neural Network