2017
DOI: 10.1101/237065
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Exploring Single-Cell Data with Deep Multitasking Neural Networks

Abstract: Handling the vast amounts of single-cell RNA-sequencing and CyTOF data, which are now being generated in patient cohorts, presents a computational challenge due to the noise, complexity, sparsity and batch effects present. Here, we propose a unified deep neural network-based approach to automatically process and extract structure from these massive datasets. Our unsupervised architecture, called SAUCIE (Sparse Autoencoder for Unsupervised Clustering, Imputation, and Embedding), simultaneously performs several … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
93
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 78 publications
(93 citation statements)
references
References 50 publications
0
93
0
Order By: Relevance
“…As the field matures, common analytical methods such as t-SNE have risen as consensus algorithms for non-linear single-cell data visualization, while new methods are being introduced to address data challenges that still limit biological interpretation. For example, new computational methods are being developed to address “drop-out noise” in single-cell RNA-seq data [49], to identify underlying data structures that are continuous and non-linear [50], and to speed up computation time [51]. Other challenges still to be addressed include integrating across different types of data sets: incorporating single-cell data with measurements collected in cell populations; and incorporating prior biological knowledge, such as time series and dose responses, to extract more biological insight from the same data set.…”
Section: Resultsmentioning
confidence: 99%
“…As the field matures, common analytical methods such as t-SNE have risen as consensus algorithms for non-linear single-cell data visualization, while new methods are being introduced to address data challenges that still limit biological interpretation. For example, new computational methods are being developed to address “drop-out noise” in single-cell RNA-seq data [49], to identify underlying data structures that are continuous and non-linear [50], and to speed up computation time [51]. Other challenges still to be addressed include integrating across different types of data sets: incorporating single-cell data with measurements collected in cell populations; and incorporating prior biological knowledge, such as time series and dose responses, to extract more biological insight from the same data set.…”
Section: Resultsmentioning
confidence: 99%
“…Notably, Amodio et al (2017), concurrently with this work, introduced an autoencoder-based approach for jointly performing batch effect correction and visualization, which finds a parametric embedding like net-SNE. Because they optimize a different objective function, however, the behavior of their embedding is fundamentally different from that of t-SNE (and thus net-SNE).…”
Section: Discussionmentioning
confidence: 99%
“…Applications of DGMs to scRNA‐seq data emerged as a useful way to embed and analyze cells in a low‐dimensional space that summarizes their transcriptomes. Here, the distances between cells in the embedding space can be used to identify phenotypically coherent groups of cells, reflecting either discrete cell types (e.g., T cells, B cells), hierarchies of types (e.g., subtypes of T cells), or variation along some continuum (e.g., progression along the cell cycle) (Ding et al , ; Lopez et al , 2018a; Wang & Gu, ; Amodio et al , ; Eraslan et al , 2019b; Rashid et al , ; Grønbech et al , ). For example, scvis (Ding et al , ) employs a VAE to learn a biologically meaningful two‐dimensional representation of single cells from oligodendroglioma samples.…”
Section: Applications To Molecular Biology and Biomedical Researchmentioning
confidence: 99%