2023
DOI: 10.1002/minf.202300288
|View full text |Cite
|
Sign up to set email alerts
|

Chemical language models for molecular design

Jürgen Bajorath

Abstract: In drug discovery, chemical language models (CLMs) originating from natural language processing offer new opportunities for molecular design. CLMs have been developed using recurrent neural network (RNN) or transformer architectures. For the predictive performance of RNN‐based encoder‐decoder frameworks and transformers, attention mechanisms play a central role. Among others, emerging application areas for CLMs include constrained generative modeling and the prediction of chemical reactions or drug‐target inte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2025
2025

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 36 publications
0
2
0
Order By: Relevance
“…De Novo Drug Design by Chemical Language Models. A focused virtual chemical library for PI3Kγ was constructed by a chemical language model (CLM) 107,108 in two steps. In the first step, the CLM was pretrained with an autoregressive approach to capture features related to 839 674 patented compounds.…”
Section: 2mentioning
confidence: 99%
“…De Novo Drug Design by Chemical Language Models. A focused virtual chemical library for PI3Kγ was constructed by a chemical language model (CLM) 107,108 in two steps. In the first step, the CLM was pretrained with an autoregressive approach to capture features related to 839 674 patented compounds.…”
Section: 2mentioning
confidence: 99%
“…Such models can be derived to learn a variety of sequence-to-sequence mappings for diverse design tasks, giving rise to different types of chemical language models (CLMs). 6–14 Popular DNN architectures from NLP include recurrent neural networks (RNNs), which were first adapted for applications in chemistry, 4–7 and different transformer networks that are increasingly used for molecular design and de novo compound generation. 10–15 Different deep generative models including transformers and diffusion models have also been developed in structure-based drug design for the discovery and optimization of new active compounds.…”
Section: Introductionmentioning
confidence: 99%