Benoît Baillif scite author profile

Finding new molecules with a desired biological activity is an extremely difficult task. In this context, artificial intelligence and generative models have been used for molecular de novo design and compound optimization. Herein, we report a generative model that bridges systems biology and molecular design, conditioning a generative adversarial network with transcriptomic data. By doing so, we can automatically design molecules that have a high probability to induce a desired transcriptomic profile. As long as the gene expression signature of the desired state is provided, this model is able to design active-like molecules for desired targets without any previous target annotation of the training compounds. Molecules designed by this model are more similar to active compounds than the ones identified by similarity of gene expression signatures. Overall, this method represents an alternative approach to bridge chemistry and biology in the long and difficult road of drug discovery.

show abstract

De Novo Generation of Hit-like Molecules from Gene Expression Signatures Using Artificial Intelligence

Méndez‐Lucio¹,

Baillif²,

Clevert³

et al. 2018

Preprint

View full text Add to dashboard Cite

Finding new molecules with a desired biological activity is an extremely difficult task. In this context, artificial intelligence and generative models have been used for molecular <i>de novo</i> design and compound optimization. Herein, we report the first generative model that bridges systems biology and molecular design conditioning a generative adversarial network with transcriptomic data. By doing this we could generate molecules that have high probability to produce a desired biological effect at cellular level. We show that this model is able to design active-like molecules for desired targets without any previous target annotation of the training compounds as long as the gene expression signature of the desired state is provided. The molecules generated by this model are more similar to active compounds than the ones identified by similarity of gene expression signatures, which is the state-of-the-art method for navigating compound-induced gene expression data. Overall, this method represents a novel way to bridge chemistry and biology to advance in the long and difficult road of drug discovery.

show abstract

De Novo Generation of Hit-like Molecules from Gene Expression Signatures Using Artificial Intelligence

Méndez‐Lucio

Baillif²,

Clevert³

et al. 2018

Preprint

View full text Add to dashboard Cite

Finding new molecules with a desired biological activity is an extremely difficult task. In this context, artificial intelligence and generative models have been used for molecular <i>de novo</i> design and compound optimization. Herein, we report the first generative model that bridges systems biology and molecular design conditioning a generative adversarial network with transcriptomic data. By doing this we could generate molecules that have high probability to produce a desired biological effect at cellular level. We show that this model is able to design active-like molecules for desired targets without any previous target annotation of the training compounds as long as the gene expression signature of the desired state is provided. The molecules generated by this model are more similar to active compounds than the ones identified by similarity of gene expression signatures, which is the state-of-the-art method for navigating compound-induced gene expression data. Overall, this method represents a novel way to bridge chemistry and biology to advance in the long and difficult road of drug discovery.

show abstract

Exploring the Use of Compound-Induced Transcriptomic Data Generated From Cell Lines to Predict Compound Activity Toward Molecular Targets

et al. 2020

View full text Add to dashboard Cite

Pharmaceutical or phytopharmaceutical molecules rely on the interaction with one or more specific molecular targets to induce their anticipated biological responses. Nonetheless, these compounds are also prone to interact with many other non-intended biological targets, also known as off-targets. Unfortunately, off-target identification is difficult and expensive. Consequently, QSAR models predicting the activity on a target have gained importance in drug discovery or in the de-risking of chemicals. However, a restricted number of targets are well characterized and hold enough data to build such in silico models. A good alternative to individual target evaluations is to use integrative evaluations such as transcriptomics obtained from compound-induced gene expression measurements derived from cell cultures. The advantage of these particular experiments is to capture the consequences of the interaction of compounds on many possible molecular targets and biological pathways, without having any constraints concerning the chemical space. In this work, we assessed the value of a large public dataset of compound-induced transcriptomic data, to predict compound activity on a selection of 69 molecular targets. We compared such descriptors with other QSAR descriptors, namely the Morgan fingerprints (similar to extended-connectivity fingerprints). Depending on the target, active compounds could show similar signatures in one or multiple cell lines, whether these active compounds shared similar or different chemical structures. Random forest models using gene expression signatures were able to perform similarly or better than counterpart models built with Morgan fingerprints for 25% of the target prediction tasks. These performances occurred mostly using signatures produced in cell lines showing similar signatures for active compounds toward the considered target. We show that compound-induced transcriptomic data could represent a great opportunity for target prediction, allowing to overcome the chemical space limitation of QSAR models.

show abstract

Deep generative models for 3D molecular structure

Baillif

Cole

McCabe

et al. 2023

Current Opinion in Structural Biology

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Benoît Baillif

De novo generation of hit-like molecules from gene expression signatures using artificial intelligence

De Novo Generation of Hit-like Molecules from Gene Expression Signatures Using Artificial Intelligence

De Novo Generation of Hit-like Molecules from Gene Expression Signatures Using Artificial Intelligence

Exploring the Use of Compound-Induced Transcriptomic Data Generated From Cell Lines to Predict Compound Activity Toward Molecular Targets

Deep generative models for 3D molecular structure

Contact Info

Product

Resources

About