2022
DOI: 10.1021/acsomega.2c00642
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of Deep Learning Architectures for Aqueous Solubility Prediction

Abstract: Determining the aqueous solubility of molecules is a vital step in many pharmaceutical, environmental, and energy storage applications. Despite efforts made over decades, there are still challenges associated with developing a solubility prediction model with satisfactory accuracy for many of these applications. The goals of this study are to assess current deep learning methods for solubility prediction, develop a general model capable of predicting the solubility of a broad range of organic molecules, and to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
26
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 43 publications
(41 citation statements)
references
References 71 publications
0
26
0
Order By: Relevance
“…These potential outliers could be handled well by tree-based algorithms but may affect the performance of DNN greatly. Many studies using other representations of a molecule, such as weave module, 40 SMILES string, 41 and molecular graph 41,42 achieved generally a performance similar to that of Morgan fingerprint or 2D/3D descriptors on the prediction of solubility or other physicochemical properties. These new representations of molecules were not applied in this study.…”
Section: ■ Resultsmentioning
confidence: 99%
“…These potential outliers could be handled well by tree-based algorithms but may affect the performance of DNN greatly. Many studies using other representations of a molecule, such as weave module, 40 SMILES string, 41 and molecular graph 41,42 achieved generally a performance similar to that of Morgan fingerprint or 2D/3D descriptors on the prediction of solubility or other physicochemical properties. These new representations of molecules were not applied in this study.…”
Section: ■ Resultsmentioning
confidence: 99%
“…Indeed, a wide-variety of deep learning architectures have been applied to the problem of predicting solubility, including graph-based neural networks, recurrent neural networks, transformers, , message-passing neural networks, deep belief networks, and others. Unlike other fields, it is not yet clear that deep learning algorithms offer a significant improvement over traditional machine learning approaches for solubility prediction . This may partly be due to a lack of accurate experimental solubility measurements for training, although strategies such as transfer learning , or multitask learning may help in some cases.…”
Section: Introductionmentioning
confidence: 99%
“…Unlike other fields, it is not yet clear that deep learning algorithms offer a significant improvement over traditional machine learning approaches for solubility prediction . This may partly be due to a lack of accurate experimental solubility measurements for training, although strategies such as transfer learning , or multitask learning may help in some cases.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The “transfer learning” approach has been extensively tested in studies devoted to the prediction of the properties of small molecules. Various DNN have appeared to be more accurate if they have been pretrained in advance on large amounts of “synthetic” (i.e., theoretically calculated) data, and only then fine-tuned with other more precise computational or experimental data. For small molecules, these results were obtained using specifically developed quantum-chemical databases comprising up to hundreds of thousands of compounds and their properties (including, for example, the well-known quantum-chemical QM9 database, the Materials Project database, and the Open Quantum Materials Database).…”
Section: Introductionmentioning
confidence: 99%