2020
DOI: 10.26434/chemrxiv.9897365.v4
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Mapping the Space of Chemical Reactions using Attention-Based Neural Networks

Abstract: <div><div><div><p>Organic reactions are usually assigned to classes grouping reactions with similar reagents and mechanisms. Reaction classes facilitate communication of complex concepts and efficient navigation through chemical reaction space. However, the classification process is a tedious task, requiring the identification of the corresponding reaction class template via annotation of the number of molecules in the reactions, the reaction center and the distinction between reactant… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
72
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
4

Relationship

3
6

Authors

Journals

citations
Cited by 40 publications
(72 citation statements)
references
References 31 publications
0
72
0
Order By: Relevance
“…Model interpretation is a significant component in any ML study. 46 In this section, we demonstrate how DRACON learns and memorizes useful information from disconnected reaction graphs. First, we investigate the learned latent vector representations of reactions.…”
Section: Resultsmentioning
confidence: 99%
“…Model interpretation is a significant component in any ML study. 46 In this section, we demonstrate how DRACON learns and memorizes useful information from disconnected reaction graphs. First, we investigate the learned latent vector representations of reactions.…”
Section: Resultsmentioning
confidence: 99%
“…We selected the target molecules from the RetroBioCat's curated set of biocatalyzed pathways 30 based on the intersection between chemistry coverage in our data set ECREACT and the data set of RetroBioCat. In fact, the encoding of ECREACT and the RetroBioCat test set using rxnfp 31 shows that the RetroBioCat test set reactions are forming distinct clusters in the TMAP-embedded reaction space (Figure 13a), in which the fraction of nearest neighbors from the set itself is consistently higher compared to reactions from ECREACT (13b). This analysis highlights the different chemistry captured by the datasets and anticipates a poor performance for those Retrobiocat examples poorly covered in the ECREACT data set (more details can be found in the Method Section, see Figure S14 for a depiction of the reaction classes' statistics from Finnigan 30 ).…”
Section: Retrosynthesis Use-casesmentioning
confidence: 99%
“…We annotated reaction SMILES for each step of the biocatalytic cascades considered in the test set from Finnigan et al 18 , excluding solvents information. For each reaction SMILES we extracted fingerprints using rxnfp 31 and we computed among the knearest neighbors (k=10), the fraction of neighbors belonging to RetroBioCat test set. The visualization of the embedded reactions was generated using TMAP.…”
Section: Retrosynthesis Routes Predictionmentioning
confidence: 99%
“…Deep-learning models applied to chemical reactions have received much attention in recent years: from the design of algorithms for forward reaction prediction [1][2][3] and retrosynthetic analysis [1,4,5] that help chemists plan the design and execution of chemical syntheses, to the generation of reaction fingerprints [6] and prediction of reaction classes [7,6], yields [8], or activation energies [9].…”
Section: Introductionmentioning
confidence: 99%