2022
DOI: 10.26434/chemrxiv-2022-0hl5p-v3
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SolvBERT for solvation free energy and solubility prediction: a demonstration of an NLP model for predicting the properties of molecular complexes

Abstract: Deep learning models based on NLP, mainly the Transformer family, have been successfully applied to solve many chemistry-related problems, but their applications are mostly limited to chemical reactions. Meanwhile, solvation is an important concept in physical and organic chemistry, describing the interaction of solutes and solvents. This interaction leads to a solvation complex, a molecular complex similar to a reactant-reagent complex. In this study, we introduced the SolvBERT model, which reads the solute a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 4 publications
0
2
0
Order By: Relevance
“…[36][37][38] For more rapid and accurate solubility predictions, various predictive models have been actively developed by analyzing quantitative structure-property relationship (QSPR) 34,[39][40][41][42][43][44] or adopting machine learning (ML) techniques. 34,42,[45][46][47][48][49][50][51][52][53][54][55][56] Particularly, current advanced ML models used graph neural networks (GNNs) combined with interaction layers 47,53,57 recurrent neural networks with attention layers, 45 and natural language processing-based transformers. 54,58 These models achieved accuracies close to experimental uncertainties.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…[36][37][38] For more rapid and accurate solubility predictions, various predictive models have been actively developed by analyzing quantitative structure-property relationship (QSPR) 34,[39][40][41][42][43][44] or adopting machine learning (ML) techniques. 34,42,[45][46][47][48][49][50][51][52][53][54][55][56] Particularly, current advanced ML models used graph neural networks (GNNs) combined with interaction layers 47,53,57 recurrent neural networks with attention layers, 45 and natural language processing-based transformers. 54,58 These models achieved accuracies close to experimental uncertainties.…”
Section: Introductionmentioning
confidence: 99%
“…34,42,[45][46][47][48][49][50][51][52][53][54][55][56] Particularly, current advanced ML models used graph neural networks (GNNs) combined with interaction layers 47,53,57 recurrent neural networks with attention layers, 45 and natural language processing-based transformers. 54,58 These models achieved accuracies close to experimental uncertainties. Furthermore, the development of ML models has been expanded to the prediction of solubility limits at different temperatures, 52 solvation enthalpy, LSER, and solute parameters, 51 and generative models for designing molecules having optimal aqueous solubility.…”
Section: Introductionmentioning
confidence: 99%