2021
DOI: 10.26434/chemrxiv.13638347
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep Generative Models Enable Navigation in Sparsely Populated Chemical Space

Abstract: Deep generative models are powerful tools for the exploration of chemical space, enabling the on-demand gener- ation of molecules with desired physical, chemical, or biological properties. However, these models are typically thought to require training datasets comprising hundreds of thousands, or even millions, of molecules. This per- ception limits the application of deep generative models in regions of chemical space populated by only a small number of examples. Here, we systematically evaluate and optimize… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
2
0
1

Year Published

2021
2021
2023
2023

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 30 publications
0
2
0
1
Order By: Relevance
“…Generative deep learning, [1, 2] that is, a class of machine learning models able to generate new data, can be applied to computationally design pharmacologically active compounds de novo [3–5] . Deep learning‐based molecular design algorithms can extract high‐level molecular features from “raw” molecular representations, [6–10] such as molecular graphs and the Simplified Molecular Input Line Entry System (SMILES, Figure 1 a), [11] potentially allowing them to access unexplored regions of the chemical space [12] . Previous studies showed that chemical language models (CLMs), [13, 14] in particular generative deep learning models trained on SMILES strings, can generate novel molecules with experimentally validated bioactivity [9, 15, 16] .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Generative deep learning, [1, 2] that is, a class of machine learning models able to generate new data, can be applied to computationally design pharmacologically active compounds de novo [3–5] . Deep learning‐based molecular design algorithms can extract high‐level molecular features from “raw” molecular representations, [6–10] such as molecular graphs and the Simplified Molecular Input Line Entry System (SMILES, Figure 1 a), [11] potentially allowing them to access unexplored regions of the chemical space [12] . Previous studies showed that chemical language models (CLMs), [13, 14] in particular generative deep learning models trained on SMILES strings, can generate novel molecules with experimentally validated bioactivity [9, 15, 16] .…”
Section: Introductionmentioning
confidence: 99%
“…[ 3 , 4 , 5 ] Deep learning‐based molecular design algorithms can extract high‐level molecular features from “raw” molecular representations,[ 6 , 7 , 8 , 9 , 10 ] such as molecular graphs and the Simplified Molecular Input Line Entry System (SMILES, Figure 1 a ), [11] potentially allowing them to access unexplored regions of the chemical space. [12] Previous studies showed that chemical language models (CLMs),[ 13 , 14 ] in particular generative deep learning models trained on SMILES strings, can generate novel molecules with experimentally validated bioactivity. [ 9 , 15 , 16 ] CLMs have shown the ability to learn focused chemical features from small collections of template molecules by means of transfer learning, that is, a method to reuse previously learned knowledge on a new task for which the available data is scarce.…”
Section: Introductionmentioning
confidence: 99%
“…Deep‐Learning‐basierte Algorithmen für das Moleküldesign können spezifische chemische Merkmale aus “rohen” Moleküldarstellungen, wie z. B. molekularen Graphen und dem Simplified Molecular Input Line Entry System (SMILES, Abbildung 1 a) [11] extrahieren, [6–10] was ihnen potenziell den Zugang zu unerforschten Regionen des chemischen Raums ermöglicht [12] . Frühere Studien haben gezeigt, dass chemische Sprachmodelle (Chemical Language Models, CLMs), [13, 14] insbesondere auf SMILES‐Strings trainierte generative Deep‐Learning‐Modelle, neuartige Moleküle mit experimentell validierter biologischer Aktivität generieren können [9, 15, 16] .…”
Section: Introductionunclassified