2020
DOI: 10.1093/bioinformatics/btaa908
|View full text |Cite
|
Sign up to set email alerts
|

Single-cell RNA-seq data semi-supervised clustering and annotation via structural regularized domain adaptation

Abstract: Motivation The rapid development of single-cell RNA sequencing (scRNA-seq) technologies allows us to explore tissue heterogeneity at the cellular level. The identification of cell types plays an essential role in the analysis of scRNA-seq data, which, in turn, influences the discovery of regulatory genes that induce heterogeneity. As the scale of sequencing data increases, the classical method of combining clustering and differential expression analysis to annotate cells becomes more costly i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2021
2021
2025
2025

Publication Types

Select...
7
1
1

Relationship

2
7

Authors

Journals

citations
Cited by 33 publications
(27 citation statements)
references
References 34 publications
0
21
0
Order By: Relevance
“…First, we transform and to low-dimensional representations and with omics-specific encoder networks f ( r ) and f ( p ) separately. To capture the character of transcriptomic (scRNA-seq) data, we utilize the same ZINB (zero-inflated negative binomial) ( Huang et al, 2018 ) model-based autoencoder as in scSemiCluster ( Chen et al, 2021 ). Protein data are not as sparse as scRNA-seq data; therefore, we empirically use a negative binomial (NB) model, a ZINB model hybrid, to characterize it.…”
Section: Methodsmentioning
confidence: 99%
“…First, we transform and to low-dimensional representations and with omics-specific encoder networks f ( r ) and f ( p ) separately. To capture the character of transcriptomic (scRNA-seq) data, we utilize the same ZINB (zero-inflated negative binomial) ( Huang et al, 2018 ) model-based autoencoder as in scSemiCluster ( Chen et al, 2021 ). Protein data are not as sparse as scRNA-seq data; therefore, we empirically use a negative binomial (NB) model, a ZINB model hybrid, to characterize it.…”
Section: Methodsmentioning
confidence: 99%
“…In order to enable the embedding component E ij to fully capture the characteristics of cells and make the classification model better fit the query dataset, we employ the embedding component as an encoder and use a mirror image of the embedding component as a decoder to construct an autoencoder [16, 17]. The reconstruction loss of cells both from the reference subset and the query subset is taken into consideration when training the classification model.…”
Section: Methodsmentioning
confidence: 99%
“…Both methods use variational inference and deep generative models to fully characterize the distribution of single-cell RNA-seq data. scANVI can also be used to annotate cell types and has been used as a baseline method for recently proposed methods, such as scSemiCluster (Chen et al, 2021) and scNym (Kimmel and Kelley, 2021). scReClassify proposed by Kim et al (2019) uses PCA to perform dimension reduction of the original single cell RNAseq data, and then apply a semi-supervised learning method to reclassify the mislabeled cell types caused by human inspection.…”
Section: Applications Of Semi-supervised Learning For Single-cell Rna...mentioning
confidence: 99%