2021
DOI: 10.1186/s13321-021-00497-0
|View full text |Cite
|
Sign up to set email alerts
|

Molecular optimization by capturing chemist’s intuition using deep neural networks

Abstract: A main challenge in drug discovery is finding molecules with a desirable balance of multiple properties. Here, we focus on the task of molecular optimization, where the goal is to optimize a given starting molecule towards desirable properties. This task can be framed as a machine translation problem in natural language processing, where in our case, a molecule is translated into a molecule with optimized properties based on the SMILES representation. Typically, chemists would use their intuition to suggest ch… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
84
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

2
5

Authors

Journals

citations
Cited by 76 publications
(100 citation statements)
references
References 33 publications
(20 reference statements)
0
84
0
Order By: Relevance
“…This nature of the modelling objective affects the choices made when preparing the validation set. While it is common to randomly hold out a portion of the training data and use these for evaluations 32 , the sliced dataset used here does not lend itself well to this approach. To be able to fairly judge the generalization ability of the model on previously unseen scaffolds, it is necessary to ensure that the validation scaffolds are not present in the training dataset.…”
Section: Validation Setmentioning
confidence: 99%
“…This nature of the modelling objective affects the choices made when preparing the validation set. While it is common to randomly hold out a portion of the training data and use these for evaluations 32 , the sliced dataset used here does not lend itself well to this approach. To be able to fairly judge the generalization ability of the model on previously unseen scaffolds, it is necessary to ensure that the validation scaffolds are not present in the training dataset.…”
Section: Validation Setmentioning
confidence: 99%
“…Following [25], the SMILES representation of molecule and the Transformer model from NLP is used in our study. The Transformer is trained on a set of molecular pairs together with the property changes between source and target molecules.…”
Section: Transformer Neural Networkmentioning
confidence: 99%
“…Figure 1 shows an example of source and target sequences which are fed into the Transformer model during training. More details can be found in [25].…”
Section: Transformer Neural Networkmentioning
confidence: 99%
See 2 more Smart Citations