2021
DOI: 10.33774/chemrxiv-2021-mc870
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Reaction Classification and Yield Prediction using the Differential Reaction Fingerprint DRFP

Abstract: Predicting the nature and outcome of reactions using computational methods is a crucial tool to accelerate chemical research. The recent application of deep learningbased learned fingerprints to reaction classification and reaction yield prediction has shown an impressive increase in performance compared to previous methods such as DFT-and structure-based fingerprints. However, learned fingerprints require large training data sets, are inherently biased, and are based on complex deep learning architectures. He… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
17
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(18 citation statements)
references
References 19 publications
1
17
0
Order By: Relevance
“…It is noteworthy that except for the descriptors about the reactants, the dipole moments of the solvents also remarkably affect the reaction yield. Reaxtica achieved better performance (R 2 = 0.87) than the DL-based method rxnfp 26 or fingerprint-based DRFP 27 using the HTE dataset. We further divided the USPTO and Reaxys dataset into 8:1:1 for training, validation and test, respectively.…”
Section: Yield Of Suzuki-miyaura Coupling Reactionmentioning
confidence: 94%
See 1 more Smart Citation
“…It is noteworthy that except for the descriptors about the reactants, the dipole moments of the solvents also remarkably affect the reaction yield. Reaxtica achieved better performance (R 2 = 0.87) than the DL-based method rxnfp 26 or fingerprint-based DRFP 27 using the HTE dataset. We further divided the USPTO and Reaxys dataset into 8:1:1 for training, validation and test, respectively.…”
Section: Yield Of Suzuki-miyaura Coupling Reactionmentioning
confidence: 94%
“…23 Regarding to SMILES-based models, Schwaller et al have applied natural language processing technology in reaction outcomes predictions, 24 which also extended to the selectivity of carbohydrate reactions 25 and reaction yield. 26,27 These DL-based models worked well in forward reaction outcome prediction in USPTO and HTE datasets. However, most of them are still black-box models that do not generalize well.…”
Section: Introductionmentioning
confidence: 91%
“…[15] The reactants and product fingerprint consist of a 512-bit extended connectivity fingerprints with a radius of 3, [16] concatenated to a 512-bit RDKit fingerprints with a maximum path length of 7. [17] Other featurization schemes of chemical reactions have been suggested recently, [18,19] but their use is outside the scope of our study.…”
Section: Methodsmentioning
confidence: 99%
“…Modeling the yield of these datasets (4K C-N couplings 15 , or 2K Suzuki-Miyaura couplings in flow 14 ) produces predictive models with R 2 or AUROC > 0.9. 11,15,[27][28][29][30][31][32][33][34] However, models trained on these datasets demonstrate limited ability to extrapolate beyond the molecules in their training sets, in part due to the minimal structural diversity in the dataset.…”
Section: Introductionmentioning
confidence: 99%