2020
DOI: 10.26434/chemrxiv.13273310.v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Completion of Partial Reaction Equations

Abstract: We present a deep-learning model for inferring missing molecules in reaction equations. Such an algorithm features multiple interesting behaviors. First, it can infer the necessary reagents and solvents in chemical transformations specified only in terms of main compounds, as often resulting from retrosynthetic analyses. The completion with necessary reagents ensures that reaction equations are compatible with deep-learning models relying on a complete reaction specification. Second, it can cure existing datas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(10 citation statements)
references
References 17 publications
0
10
0
Order By: Relevance
“…Then, help may be provided by additional data-driven models, such as models providing the missing compounds in incomplete chemical equations. 50 Since SCINE Chemoton cannot steer an exploration into a desired direction based on the minimal input, the combinatorial explosion resulting from attempts to react every atom and atom pair in every molecule with every atom and atom pair in every other molecule will be computationally demanding and must be tamed. By providing Chemoton with additional information on a reactive system under consideration, the exploration can be made much more efficient.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Then, help may be provided by additional data-driven models, such as models providing the missing compounds in incomplete chemical equations. 50 Since SCINE Chemoton cannot steer an exploration into a desired direction based on the minimal input, the combinatorial explosion resulting from attempts to react every atom and atom pair in every molecule with every atom and atom pair in every other molecule will be computationally demanding and must be tamed. By providing Chemoton with additional information on a reactive system under consideration, the exploration can be made much more efficient.…”
Section: Methodsmentioning
confidence: 99%
“…With state of the art AI models, however, it is hard to determine byproducts automatically. In principle, the aforementioned chemical equation completion 50 is able to provide missing byproducts. In practice, however, data sets of chemical reactions do not provide this information, which is either unknown or of no practical interest to chemists.…”
Section: Methodsmentioning
confidence: 99%
“…Such sequence-2-sequence methods for forward prediction, retrosynthesis, and agent completion can be atom-mapping independent, as the reactant and product atoms do not have to be linked in the training reactions. 139 , 140 , 141
Figure 11 An example of a molecular transformer, which uses Smiles to represent and transform reactant and agent molecules into the product of the reaction, as used by Schwaller et al. 139 The tokenization of the Smiles is shown by the bold characters separated with spaces.
…”
Section: Reactionsmentioning
confidence: 99%
“…Unlike the existing approaches designed specically for reagent prediction, our formulation is not conned to a predened set of possible reagents and allows the prediction of reagents for arbitrary reaction types. Whereas in principle our model is not the rst transformer suitable for reagent prediction, 31,32 it is the rst one to be trained specically for reagent prediction in a machine translation setting. We also demonstrate that the reagent prediction model can be used to improve a product prediction model trained on the USPTO dataset.…”
Section: Related Workmentioning
confidence: 99%