Abstract:In drug discovery, chemical language models (CLMs) originating from natural language processing offer new opportunities for molecular design. CLMs have been developed using recurrent neural network (RNN) or transformer architectures. For the predictive performance of RNN‐based encoder‐decoder frameworks and transformers, attention mechanisms play a central role. Among others, emerging application areas for CLMs include constrained generative modeling and the prediction of chemical reactions or drug‐target inte… Show more
“…De Novo Drug Design by Chemical Language Models. A focused virtual chemical library for PI3Kγ was constructed by a chemical language model (CLM) 107,108 in two steps. In the first step, the CLM was pretrained with an autoregressive approach to capture features related to 839 674 patented compounds.…”
“…De Novo Drug Design by Chemical Language Models. A focused virtual chemical library for PI3Kγ was constructed by a chemical language model (CLM) 107,108 in two steps. In the first step, the CLM was pretrained with an autoregressive approach to capture features related to 839 674 patented compounds.…”
“…Such models can be derived to learn a variety of sequence-to-sequence mappings for diverse design tasks, giving rise to different types of chemical language models (CLMs). 6–14 Popular DNN architectures from NLP include recurrent neural networks (RNNs), which were first adapted for applications in chemistry, 4–7 and different transformer networks that are increasingly used for molecular design and de novo compound generation. 10–15 Different deep generative models including transformers and diffusion models have also been developed in structure-based drug design for the discovery and optimization of new active compounds.…”
Shown is the extension of an analogue series with a new potent compound using a chemical language model. Substitution sites and non-hydrogen R-groups are colored in red (the log-likelihood score for the new analogue is reported in parentheses).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.