2020
DOI: 10.1038/s41467-020-19594-z
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning with physicochemical relationships: solubility prediction in organic solvents and water

Abstract: Solubility prediction remains a critical challenge in drug development, synthetic route and chemical process design, extraction and crystallisation. Here we report a successful approach to solubility prediction in organic solvents and water using a combination of machine learning (ANN, SVM, RF, ExtraTrees, Bagging and GP) and computational chemistry. Rational interpretation of dissolution process into a numerical problem led to a small set of selected descriptors and subsequent predictions which are independen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
216
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 195 publications
(220 citation statements)
references
References 44 publications
(50 reference statements)
4
216
0
Order By: Relevance
“…By inputting ALD in accordance with the atomic sequence in the input le, it is accessible to reconstruct chemical properties of atoms in the ML models. Boobier and co-workers also extracted atomic features in predicting the solubility of solutes in different solvents, but the features of the same elements were summed as descriptors nally 31 . We think this method will lead to the loss of atomic information by virtue of different atomic positions in the molecular structure.…”
Section: Resultsmentioning
confidence: 99%
“…By inputting ALD in accordance with the atomic sequence in the input le, it is accessible to reconstruct chemical properties of atoms in the ML models. Boobier and co-workers also extracted atomic features in predicting the solubility of solutes in different solvents, but the features of the same elements were summed as descriptors nally 31 . We think this method will lead to the loss of atomic information by virtue of different atomic positions in the molecular structure.…”
Section: Resultsmentioning
confidence: 99%
“…Sixty five structural group descriptors were used to generate the models. In 2020, Boobier et al proposed to develop models by using MLR, PLS, shallow neural networks, SVM, GP, RF, and ET for solubility prediction in water, ethanol, acetone, and benzene [ 31 ]. The models were constructed on 695 ethanol data, 464 benzene data, and 452 acetone data.…”
Section: Discussionmentioning
confidence: 99%
“…Machine learning algorithms have also been employed to construct models for solubility prediction in various organic solvents [ 7 , 30 , 31 ]. The models were generated on experimental solubility data.…”
Section: Introductionmentioning
confidence: 99%
“…Machine Learning (ML) and Artificial Intelligence (AI) has gained tremendous importance and applications in the chemical industries ( Boobier et al., 2020 ; Dimiduk et al., 2018 ; Ge et al., 2017 ; Liu et al., 2017 ; Shang and You, 2019 ) They are emerging fields that can transform solvent and technology selection. In recent times, ML algorithms have been used to predict the physicochemical properties of organic solvents.…”
Section: Emerging Trends In Designing Solvent Recoverymentioning
confidence: 99%
“…In recent times, ML algorithms have been used to predict the physicochemical properties of organic solvents. An example is the prediction of the solubility of organic solvents in water presented by Boobier et al. (2020) .…”
Section: Emerging Trends In Designing Solvent Recoverymentioning
confidence: 99%