2022
DOI: 10.1039/d1dd00052g
|View full text |Cite
|
Sign up to set email alerts
|

A transfer learning protocol for chemical catalysis using a recurrent neural network adapted from natural language processing

Abstract: Minimizing the time and material investments in discovering molecular catalysis would be immensely beneficial. Given the high contemporary importance of homogeneous catalysis in general, and asymmetric catalysis in particular, makes...

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 17 publications
(22 citation statements)
references
References 49 publications
0
22
0
Order By: Relevance
“…We have performed principal component analysis (PCA) on the encoder output (Figure 4b) followed by k-means clustering on the first two principal components. 34 Interestingly, distinct clusters were obtained in each reaction type. For example, structurally similar chiral ligands/catalysts in the case of the asymmetric hydrogenation reaction were found to remain in the same cluster.…”
Section: Modelsmentioning
confidence: 99%
See 3 more Smart Citations
“…We have performed principal component analysis (PCA) on the encoder output (Figure 4b) followed by k-means clustering on the first two principal components. 34 Interestingly, distinct clusters were obtained in each reaction type. For example, structurally similar chiral ligands/catalysts in the case of the asymmetric hydrogenation reaction were found to remain in the same cluster.…”
Section: Modelsmentioning
confidence: 99%
“…33 On this front, we have utilized the tools from the domain of NLP to predict % ee/yield of three diverse catalytic reactions (Figure 4a). 34 The diverse reaction types comprise both transition-metal-catalyzed and organocatalytic transformations with varying data sizes and distributions. SMILES strings were employed to express the molecules as linear strings of characters conducive for language models (LMs).…”
Section: Reaction Outcome Predictionmentioning
confidence: 99%
See 2 more Smart Citations
“…The use of DL for molecular discovery can broadly be categorized into reaction outcome prediction and molecule generation ( Gomez-Bombarelli et al., 2018 ; Miljkovic et al., 2021 ; Walters and Barzilay, 2021 ). The predictive DL models, consisting of multiple hidden layers, have been used for predictions of molecular properties as well as the yield/selectivities of reactions ( Schwaller et al., 2021 ; Senior et al., 2020 ; Singh and Sunoj, 2022 ). First, DL methods can take molecular structures to learn the data-driven feature representation with minimal feature engineering ( Winter et al., 2019 ; Li et al., 2021 ; Atz et al., 2021 ; Mi et al., 2021 ; Lim and Jung, 2019 ).…”
Section: Introductionmentioning
confidence: 99%