2017
DOI: 10.48550/arxiv.1711.04810
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

"Found in Translation": Predicting Outcomes of Complex Organic Chemistry Reactions using Neural Sequence-to-Sequence Models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(9 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…In the broader context of Importantly, properties such as atomic number (Z) are often encoded using one hot vectors, which are binary, but for spatial efficiency, the integer is used in its place. The RNN model shows a simplified "many to many" recurrent network, with the text above and below the dashed lines indicating a stylized reaction prediction system inspired by the work of Schwaller et al 61 This system takes in reagent and agent SMILES and predicts the variable length product string; however, the LSTM architecture they used is significantly more complex than the one shown above. artificial intelligence, this means that these systems are weak AI, capable of solving only a single, extremely narrow task and not capable of meaningfully answering even slight deviations from the question it was trained on.…”
Section: ■ the Big Picturementioning
confidence: 99%
“…In the broader context of Importantly, properties such as atomic number (Z) are often encoded using one hot vectors, which are binary, but for spatial efficiency, the integer is used in its place. The RNN model shows a simplified "many to many" recurrent network, with the text above and below the dashed lines indicating a stylized reaction prediction system inspired by the work of Schwaller et al 61 This system takes in reagent and agent SMILES and predicts the variable length product string; however, the LSTM architecture they used is significantly more complex than the one shown above. artificial intelligence, this means that these systems are weak AI, capable of solving only a single, extremely narrow task and not capable of meaningfully answering even slight deviations from the question it was trained on.…”
Section: ■ the Big Picturementioning
confidence: 99%
“…Then a molecules, such as Benzene, is represented in SMILES notation as c1ccccc1. It has already been shown that SMILES representation of molecules has been effective in chemoinformatics [2,10,31,32]. This has strengthened our belief that recent advances in the field of deep computational linguistics and generative models might have an immense impact on prototype based drug development.…”
Section: Molecule Representationmentioning
confidence: 99%
“…Molecular raw data can be represented in few ways, and processed with different deep architectures. Among those we can find 2D/3D images served as input to a convolutional neural network (CNN) [9,35], molecular graph representation paired with neural graph embedding methods [5,39], and SMILES stringsmodeled as a language model with recurrent neural network (RNN) [2,10,31] [18] generative model, to learn dense molecular embedding space. At test time, the model is able to generate new molecules from samples of the prior distribution enforced on the latent representation during training.…”
Section: Related Workmentioning
confidence: 99%
“…With a similar goal, Jacob and Lapkin build a stochastic block model (SBM) for the classification of reactions into true or false using reactions in Reaxys (true) and ones randomly generated from known chemicals (false) [199]. Other machine learning-based methods include ones that rank enumerated mechanistic [200][201][202] or pseudo-mechanistic [203] steps, score/rank reaction templates [153,204], score/rank candidate products generated from reaction templates [205], propose reaction products as resulting from sets of graph edits [206,207], and translate reactant SMILES strings to product SMILES strings using models built for natural language processing tasks [208][209][210]. These all formulate reaction prediction differently; for example, the model in ref.…”
Section: Discovering Models Of Chemical Reactivitymentioning
confidence: 99%