Predicting drug polypharmacology from cell morphology readouts using variational autoencoder latent space arithmetic

Chow, Yuen Ler; Singh, Shantanu; Carpenter, Anne E.; Way, Gregory P.

doi:10.1101/2021.09.02.458673

Cited by 2 publications

(2 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Future studies would benefit from larger datasets, such as the announced future data depositions from the JUMP consortium 80 , and also more and better annotated compounds that show mitochondrial toxicity under different assays and dosages such as from the Mitotox database 46 . It may also be possible to apply different types of machine learning or deep learning models, such as deep neural networks, gradient boosting, or a variational autoencoder (which has been previously shown to reveal an interpretable latent space 81 ) to improve the model’s predictions and generally improve the interpretability of models.…”

Section: Resultsmentioning

confidence: 99%

Integrating Cell Morphology with Gene Expression and Chemical Structure to Aid Mitochondrial Toxicity Detection

Seal

Carreras-Puigvert

Trapotsi

et al. 2022

Preprint

View full text Add to dashboard Cite

Mitochondrial toxicity is an important safety endpoint in drug discovery. Models based solely on chemical structure for predicting mitochondrial toxicity are currently limited in accuracy and applicability domain to the chemical space of the training compounds. In this work, we aimed to utilize both -omics and chemical data to push beyond the state-of-the-art. We combined Cell Painting and Gene Expression data with chemical structural information from Morgan fingerprints for 382 chemical perturbants tested in the Tox21 mitochondrial membrane depolarization assay. We observed that mitochondrial toxicants significantly differ from non-toxic compounds in morphological space and identified compound clusters having similar mechanisms of mitochondrial toxicity, thereby indicating that morphological space provides biological insights related to mechanisms of action of this endpoint. We further showed that models combining Cell Painting, Gene Expression features and Morgan fingerprints improved model performance on an external test set of 236 compounds by 60% (in terms of F1 score) and improved extrapolation to new chemical space. The performance of our combined models was comparable with dedicated in vitro assays for mitochondrial toxicity; and they were able to detect mitochondrial toxicity where Tox21 assays outcomes were inconclusive because of cytotoxicity. Our results suggest that combining chemical descriptors with different levels of biological readouts enhances the detection of mitochondrial toxicants, with practical implications for use in drug discovery.

show abstract

Section: Resultsmentioning

confidence: 99%

Integrating Cell Morphology with Gene Expression and Chemical Structure to Aid Mitochondrial Toxicity Detection

Seal

Carreras-Puigvert

Trapotsi

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…As shown in Figure 1, small molecules enter into cells and affect their biological functions and pathways, leading to morphological changes in cell shape, number, structure, etc., that are visible in microscopy images after staining. Analysis and modeling based on these high-content images have shown great success in molecular bioactivity prediction [20], mechanism identification [21], polypharmacology prediction [22], etc. The stained cell images contain rich morphological information that reflects the biological changes induced by chemical structures on cell cultures.…”

Section: Introductionmentioning

confidence: 99%

Cross-modal Graph Contrastive Learning with Cellular Images

Zheng

Rao

Zhang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Constructing discriminative representations of molecules lies at the core of a number of domains such as drug discovery, material science, and chemistry. State-of-the-art methods employ graph neural networks (GNNs) and self-supervised learning (SSL) to learn the structural representations from unlabeled data, which can then be fine-tuned for downstream tasks. Albeit powerful, these methods that are pre-trained solely on molecular structures cannot generalize well to the tasks involved in intricate biological processes. To cope with this challenge, we propose using high-content cell microscopy images to assist in learning molecular representation. The fundamental rationale of our method is to leverage the correspondence between molecular topological structures and the caused perturbations at the phenotypic level. By including cross-modal pre-training with different types of contrastive loss functions in a unified framework, our model can efficiently learn generic and informative representations from cellular images, which are complementary to molecular structures. Empirical experiments demonstrated that the model transfers non-trivially to a variety of downstream tasks and is often competitive with the existing SSL baselines, e.g., a 15.4% absolute Hit@10 gains in graph-image retrieval task and a 4.0% absolute AUC improvements in clinical outcome predictions. Further zero-shot case studies show the potential of the approach to be applied to real-world drug discovery.

show abstract

Predicting drug polypharmacology from cell morphology readouts using variational autoencoder latent space arithmetic

Cited by 2 publications

References 38 publications

Integrating Cell Morphology with Gene Expression and Chemical Structure to Aid Mitochondrial Toxicity Detection

Integrating Cell Morphology with Gene Expression and Chemical Structure to Aid Mitochondrial Toxicity Detection

Cross-modal Graph Contrastive Learning with Cellular Images

Contact Info

Product

Resources

About