2021
DOI: 10.1016/j.isci.2020.101961
|View full text |Cite
|
Sign up to set email alerts
|

Pushing the limits of solubility prediction via quality-oriented data selection

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2
1
1

Relationship

3
5

Authors

Journals

citations
Cited by 44 publications
(36 citation statements)
references
References 38 publications
0
36
0
Order By: Relevance
“…The AqSolPred model, which was used for solubility predictions in the current work, had previously been validated on a benchmark solubility dataset 24 . The model has a Mean Absolute Error of 0.348 LogS, which is lower than the conventional cheminformatics and ML methods that are ordinarily used for the prediction of aqueous solubility of chemical species 13 .…”
Section: Validation Of Solubility Predictionsmentioning
confidence: 89%
See 2 more Smart Citations
“…The AqSolPred model, which was used for solubility predictions in the current work, had previously been validated on a benchmark solubility dataset 24 . The model has a Mean Absolute Error of 0.348 LogS, which is lower than the conventional cheminformatics and ML methods that are ordinarily used for the prediction of aqueous solubility of chemical species 13 .…”
Section: Validation Of Solubility Predictionsmentioning
confidence: 89%
“…It is an exemplary resource on quinone and aza-aromatic electroactive compounds as it contains several candidate molecules for batteries that are worthy of experimental investigation. The database contains comprehensive data that has been systematically collected by using the state-of-the-art computational procedures 1112 and data-driven methods 13 . Therefore, it's also useful for other applications beyond ARFBs for which the intriguing chemistry of these molecules matter.…”
Section: Background and Summarymentioning
confidence: 99%
See 1 more Smart Citation
“…If it is true that the quality of QSPR models is directly influenced by the quality of the datasets on which they are trained, it is equally true that the assessment of predictive performances of a model is strongly influenced by the accuracy of the data on which it is tested. Indeed, the observed performances derive from the contribution of the actual predictive performances (defined as the accuracy of a model that would be observed on a test set with zero internal error) and the uncertainty in the test data [ 38 ]. In the case of the intrinsic solubility, due to poor reliability of experimental solubility values, the observed performances could be significantly influenced by the errors affecting the logS 0 values of the test compounds.…”
Section: Discussionmentioning
confidence: 99%
“…29,30 Accordingly, the linear regression equations used for predicting the experimental redox potentials of the reduction of C=O groups on quinone-like 29 The aqueous solubility data of the reactant molecules was obtained by using AqSolPred v1.0. 31 , which is a state-of-the-art consensus ML model that had been trained on the largest open-source measured solubility dataset of chemical compounds. 20 In this context, the approach employed here is in stark contrast to other HTVS efforts that make use of the computed aqueous solvation free energy, ∆ 8"9: " , as an approximate descriptor for the solubility of compounds.…”
Section: Physics-based and Data-driven Modellingmentioning
confidence: 99%