2023
DOI: 10.26434/chemrxiv-2023-gnzpf
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Transfer Learning for a Foundational Chemistry Model

Emma King-Smith

Abstract: Data-driven chemistry has garnered much interest concurrent with improvements in hardware and the development of new machine learning models. However, a notable bottleneck for data-driven chemistry specifically is the challenge in obtaining sufficiently large, accurate datasets of a desired chemical outcome. Herein, I develop a machine learning framework that makes prediction amid low data: First, a chemical “foundational model” is trained using on a dataset of ~1 million experimental organic crystal structure… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…DL methods are capable of transfer learning, a technique commonly used in low-data settings, in which the model is pre-trained with more readily available data that provides the model with implicit information about the main task. 220,225,226 By leveraging the libraries of computational results, and historical empirical results, models can be preconditioned with physicochemical information for SDL campaigns that typically start in the low-data regime. Such models can even be used to encode chemical compounds as task-specific descriptors, compressing the chemical information into expressive abstract representations.…”
Section: Role Of Artificial Intelligence In Chemical Discoverymentioning
confidence: 99%
“…DL methods are capable of transfer learning, a technique commonly used in low-data settings, in which the model is pre-trained with more readily available data that provides the model with implicit information about the main task. 220,225,226 By leveraging the libraries of computational results, and historical empirical results, models can be preconditioned with physicochemical information for SDL campaigns that typically start in the low-data regime. Such models can even be used to encode chemical compounds as task-specific descriptors, compressing the chemical information into expressive abstract representations.…”
Section: Role Of Artificial Intelligence In Chemical Discoverymentioning
confidence: 99%
“…It has gained popularity, as it implements a modern approach to describing polycyclic hydrocarbons formation, and can also be supplemented with NO x and soot formation reaction submodules. The performance of the mechanisms on methane-hydrogen mixtures autoignition was also compared with other classical kinetic schemes-GRI-Mech 3.0, which has been the industry standard for the last two decades [27]; FFCM-1, resulting from modern efforts of global constrained optimization within the uncertainties of reaction rate parameters [28]; and Aramco 2.0 [29].…”
Section: Modelingmentioning
confidence: 99%