2020
DOI: 10.1101/2020.04.27.064816
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Tuning parameters of dimensionality reduction methods for single-cell RNA-seq analysis

Abstract: Many computational methods have been developed recently to analyze single-cell RNA-seq (scRNA-seq) data. Several benchmark studies have compared these methods on their ability for dimensionality reduction, clustering or differential analysis, often relying on default parameters. Yet given the biological diversity of scRNA-seq datasets, parameter tuning might be essential for the optimal usage of methods, and determining how to tune parameters remains an unmet need. Here, we propose a benchmark to assess the pe… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(14 citation statements)
references
References 29 publications
0
14
0
Order By: Relevance
“…This assumption is however challenged by recent studies which highlight significant differences in gene expression patterns 11,16 , mostly resulting from the lack of micro-environment in cell lines ( Figure 1 A). To assess this cell-type drift and evaluate the ability of existing batch-effect correction tools to correct it, we employed three state-of-the-art approaches 33,34 : Seurat v3 29 , Harmony 35 and LIGER 32 (Methods). For tumors, we use a panel of 208,506 cells from 58 NSCLC cancer patients at different disease stages 20 , including 36,467 epithelial cells ( Extended Figure 1 A-B), referred to as the Kim dataset .…”
Section: Resultsmentioning
confidence: 99%
“…This assumption is however challenged by recent studies which highlight significant differences in gene expression patterns 11,16 , mostly resulting from the lack of micro-environment in cell lines ( Figure 1 A). To assess this cell-type drift and evaluate the ability of existing batch-effect correction tools to correct it, we employed three state-of-the-art approaches 33,34 : Seurat v3 29 , Harmony 35 and LIGER 32 (Methods). For tumors, we use a panel of 208,506 cells from 58 NSCLC cancer patients at different disease stages 20 , including 36,467 epithelial cells ( Extended Figure 1 A-B), referred to as the Kim dataset .…”
Section: Resultsmentioning
confidence: 99%
“…We removed cells three median absolute deviations (MAD) under the mean in counts and expressed genes, as well as those three MAD above the mean in percentage of mitochondrial reads. The code and data used in this manuscript, as well as the 5,000 pre-computed embeddings provided as a benchmark to develop new heuristics for parameter selection, are freely available under an Apache License 2.0 [30,31].…”
Section: Methodsmentioning
confidence: 99%
“…Approaches like scVI can generally yield an improvement over shallow approaches such as PCA (Raimundo et al 2020). Although there is not always a benefit in using deep models over shallow approaches Bellot et al (2018), in theory, deep approaches should always result in a better performance given enough data being available.…”
Section: Training Data and Hyperparameter Optimizationmentioning
confidence: 99%