2018
DOI: 10.1002/ange.201806920
|View full text |Cite
|
Sign up to set email alerts
|

Prediction of Major Regio‐, Site‐, and Diastereoisomers in Diels–Alder Reactions by Using Machine‐Learning: The Importance of Physically Meaningful Descriptors

Abstract: Machine learning can predict the major regio-, site-, and diastereoselective outcomes of Diels-Alder reactions better than standardq uantum-mechanical methods and with accuracies exceeding 90 %provided that i) the diene/dienophile substrates are represented by "physical-organic" descriptors reflecting the electronic and steric characteristics of their substituents and ii)t he positions of such substituents relative to the reaction core are encoded ("vectorized") in an informative way.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

1
19
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
7
2

Relationship

2
7

Authors

Journals

citations
Cited by 29 publications
(20 citation statements)
references
References 47 publications
1
19
0
Order By: Relevance
“…In that regard, adversarial controls should be designed to disprove alternative model performance hypotheses and distinguish between the exploitation of confounding variables/experimental artefacts and chemically meaningful patterns. Sensible control recommendations, paralleling good wet-laboratory practice, can be found elsewhere 88 and have actually shown utility 89 . In one instance, different dummy variable systems, decorrelated from chemical insight, were used to validate random forests as true high-performance classifiers of regioselectivity, site selectivity and diastereoselectivity in Diels-Alder reactions, only when a conjugation of electronic and steric indices are employed as features (74-83% accuracy for dummy variables versus 93% accuracy for Hammettsteric descriptors) 89 .…”
Section: Softmax Layermentioning
confidence: 99%
“…In that regard, adversarial controls should be designed to disprove alternative model performance hypotheses and distinguish between the exploitation of confounding variables/experimental artefacts and chemically meaningful patterns. Sensible control recommendations, paralleling good wet-laboratory practice, can be found elsewhere 88 and have actually shown utility 89 . In one instance, different dummy variable systems, decorrelated from chemical insight, were used to validate random forests as true high-performance classifiers of regioselectivity, site selectivity and diastereoselectivity in Diels-Alder reactions, only when a conjugation of electronic and steric indices are employed as features (74-83% accuracy for dummy variables versus 93% accuracy for Hammettsteric descriptors) 89 .…”
Section: Softmax Layermentioning
confidence: 99%
“…We opted for physical organic chemistry features which would be chemically understandable and transferable to other reactions. 28 We selected features associated with nucleophilicity, electrophilicity, sterics, dispersion and bonding as well as features describing the solvent.…”
Section: Reaction Feature Generationmentioning
confidence: 99%
“…Diverse domains of modern science have already embraced the benefits of machine learning (ML) with an impressive degree of success (7)(8)(9)(10)(11). There have also been some important applications of ML in chemistry at large and catalytic reactions in particular (12)(13)(14)(15)(16)(17)(18)(19)(20)(21)(22)(23)(24)(25)(26)(27), such as for predicting selectivities (28)(29)(30)(31)(32)(33)(34)(35)(36). We intend to design a practically useful ML protocol to be deployed for asymmetric hydrogenation of substrates bearing C = C and C = N bonds using axially chiral binaphthyl-derived catalyst families.…”
mentioning
confidence: 99%