2020
DOI: 10.1093/nargab/lqaa039
|View full text |Cite
|
Sign up to set email alerts
|

Deep soft K-means clustering with self-training for single-cell RNA sequence data

Abstract: Single-cell RNA sequencing (scRNA-seq) allows researchers to study cell heterogeneity at the cellular level. A crucial step in analyzing scRNA-seq data is to cluster cells into subpopulations to facilitate subsequent downstream analysis. However, frequent dropout events and increasing size of scRNA-seq data make clustering such high-dimensional, sparse and massive transcriptional expression profiles challenging. Although some existing deep learning-based clustering algorithms for single cells combine dimension… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
84
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 83 publications
(84 citation statements)
references
References 50 publications
0
84
0
Order By: Relevance
“…Furthermore, they are actually located on an inenarrable low-dimensional manifold. Therefore, we use the deep autoencoder representation to approximate this parameter space and estimate three groups of parameters by three output layers in a manner similar to that of the DCA and scziDesk model [ 7 , 28 ]. To take the batch effects into account, we merge the expression data matrix x with batch matrix b as the input of the encoder network.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…Furthermore, they are actually located on an inenarrable low-dimensional manifold. Therefore, we use the deep autoencoder representation to approximate this parameter space and estimate three groups of parameters by three output layers in a manner similar to that of the DCA and scziDesk model [ 7 , 28 ]. To take the batch effects into account, we merge the expression data matrix x with batch matrix b as the input of the encoder network.…”
Section: Methodsmentioning
confidence: 99%
“…For example, exploring cell-to-cell interactions and gene-to-gene interactions are often based on specific cell types [4,5]. In the past few years, a large volume of single-cell transcriptome unsupervised clustering algorithms have emerged based on the similarity of gene expression patterns [6][7][8][9]. However, in the absence of a unified standard, the clustering results of different algorithms usually show a little degree of overlap and even vary widely [10].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Semi-supervised learning, a data incremental learning technology, is invented to obtain more labelled data by labeled unlabelled samples . In addition, self-training, a famous semi-supervised method, can implement high-confidance and low loss learning mechanism to improve performance of the model [6]. Moreover, common classifiers met tumor data do not achieve satisfac-tory performance since characteristics of the high-dimension and small sample size lead to over-fiiting [7].…”
Section: Introductionmentioning
confidence: 99%