2021
DOI: 10.1101/2021.01.13.426593
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning interpretable cellular and gene signature embeddings from single-cell transcriptomic data

Abstract: The advent of single-cell RNA sequencing (scRNA-seq) technologies has revolutionized transcriptomic studies. However, integrative analysis of scRNA-seq data remains a challenge largely due to batch effects. We present single-cell Embedded Topic Model (scETM), an unsupervised deep generative model that recapitulates known cell types by inferring the latent cell topic mixtures via a variational autoencoder. scETM is scalable to over 106 cells and enables effective knowledge transfer across datasets. scETM also o… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
33
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 17 publications
(34 citation statements)
references
References 97 publications
1
33
0
Order By: Relevance
“…scETM source codes as well as the benchmarking workflows have been deposited at the GitHub repository 78…”
Section: Data Availabilitymentioning
confidence: 99%
“…scETM source codes as well as the benchmarking workflows have been deposited at the GitHub repository 78…”
Section: Data Availabilitymentioning
confidence: 99%
“…First, the LDVAE decoder is restricted to use only linear activation functions in order to achieve interpretability; thus, LDVAE performs linear dimensionality reduction. scETM performs a similar dimensionality reduction, but further breaks down the loading matrix through tri-factorization 32 . Second, the LDVAE loss function uses a negative binomial or zero-inflated negative binomial distribution over the input features (genes), instead of the Gaussian distribution used in a classic VAE.…”
Section: Results -Sivae Accurately Generates Low Dimensional Embeddings Of Cellsmentioning
confidence: 99%
“…We first proposed a regularized linear decoder to include domain knowledge into autoencoders for single-cell data at a conference 86 , with scalable and expressive embeddings when compared to existing factor models, such as f-scLVM 87 . Recent approaches such as VEGA 88 , scTEM 89 and pmVAE 90 also feature VAE-based architectures with linear decoders or training separate VAEs for each GP yet connected via a global loss in the case of pmVAE. In contrast, expiMap aims toward interpretable reference mapping allowing to fuse reference atlases with GPs and enabling the query of genes or GPs.…”
Section: Discussionmentioning
confidence: 99%