2020
DOI: 10.1101/2020.01.14.906768
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning to Encode Cellular Responses to Systematic Perturbations with Deep Generative Models

Abstract: Components of cellular signaling systems are organized as hierarchical networks, and perturbing different components of the system often leads to transcriptomic profiles that exhibit compositional statistical patterns. Mining such patterns to investigate how cellular signals are encoded is an important problem in systems biology. Here, we investigated the capability of deep generative models (DGMs) for modeling signaling systems and learning representations for transcriptomic profiles derived from cells under … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(8 citation statements)
references
References 29 publications
0
8
0
Order By: Relevance
“…However, the MMD-VAE failed to simulate all cell morphology modes, which the other VAE variants successfully captured. By training these VAE variants on L1000 gene expression readouts resulting from the same set of compound perturbations, we observed large differences in optimal hyperparameters as compared to training on Cell Painting image-based data (Subramanian et al, 2017;Xue et al, 2020). These observations indicate that the KL divergence penalty strongly influences cell morphology modeling ability, and that lessons learned by modeling other biomedical data types, such as gene expression, will not necessarily directly translate to cell morphology (Yang et al, 2021).…”
Section: Discussionmentioning
confidence: 97%
See 1 more Smart Citation
“…However, the MMD-VAE failed to simulate all cell morphology modes, which the other VAE variants successfully captured. By training these VAE variants on L1000 gene expression readouts resulting from the same set of compound perturbations, we observed large differences in optimal hyperparameters as compared to training on Cell Painting image-based data (Subramanian et al, 2017;Xue et al, 2020). These observations indicate that the KL divergence penalty strongly influences cell morphology modeling ability, and that lessons learned by modeling other biomedical data types, such as gene expression, will not necessarily directly translate to cell morphology (Yang et al, 2021).…”
Section: Discussionmentioning
confidence: 97%
“…Recently, VAEs have been successfully trained on various biomedical data modalities such as bulk and single-cell gene expression data (Xue et al, 2020) from different assays measuring cell line perturbations and patient-derived tissue (Lotfollahi et al, 2019;Rampášek et al, 2019;Lopez et al, 2018;Way & Greene, 2018), DNA methylation (Levy et al, 2020), and cell image pixels (Lafarge et al, 2018;Ternes et al, 2021). β-VAEs have been used to produce disentangled latent representations of single cell RNA-seq data (Kimmel, 2020).…”
Section: Introductionmentioning
confidence: 99%
“…To obtain an integrative drug embedding (IDE) reflecting multiple aspects of the drug, we adopted a semi-supervised method 19 that integrated two sources of information: 1) chemical structure of the drug, and 2) functional impact of the drug on gene expression. 17 To represent chemical structures, we used the SMILES 22 representation of molecules and obtained SMILES strings of 250K drug-like molecules from the ZINC 23 database. To represent the functional impact on gene expression, we trained a variational auto-encoder (VAE) 24 model on the transcriptomic data of cell lines treated by different drugs from the LINCS database.…”
Section: Methodsmentioning
confidence: 99%
“…Prior approaches to learning representations of drugs have transformed their chemical structures from a form defined by the simplified molecularinput line-entry system (SMILES) into a vector (an embedding) that can be concatenated with cell embeddings in a deep learning model to predict drug response. [14][15][16] However, this approach does not utilize a rich body of information regarding the functional impact of chemicals on cell signaling systems, 8,9,17 which is highly relevant to the MOAs of drugs 18 and thus relevant to predicting drug responses.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation