2021
DOI: 10.1016/j.cels.2020.11.008
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking Computational Doublet-Detection Methods for Single-Cell RNA Sequencing Data

Abstract: In single-cell RNA sequencing (scRNA-seq), doublets form when two cells are encapsulated into one reaction volume by chance. The existence of doublets, which appear to be-but are not-real cells, is a key confounder in scRNA-seq data analysis. Computational methods have been developed to detect doublets in scRNA-seq data; however, the scRNA-seq field lacks a comprehensive benchmarking of these methods, making it difficult for researchers to choose an appropriate method for their specific analysis needs. Here, w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
141
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 136 publications
(141 citation statements)
references
References 74 publications
0
141
0
Order By: Relevance
“…Surprisingly, DoubletDecon 28 has estimated an exceptionally high doublet rate with only 0.01% of the detected doublets is in agreement with the conserved doublets. This result could be explained by the benchmark experiment against DoubletFinder, Scrublet and solo; DoubletDecon tends to have higher sensitivity and lower specificity 28 , 29 . Therefore, we considered this inconsistent result as an outlier.…”
Section: Discussionmentioning
confidence: 99%
“…Surprisingly, DoubletDecon 28 has estimated an exceptionally high doublet rate with only 0.01% of the detected doublets is in agreement with the conserved doublets. This result could be explained by the benchmark experiment against DoubletFinder, Scrublet and solo; DoubletDecon tends to have higher sensitivity and lower specificity 28 , 29 . Therefore, we considered this inconsistent result as an outlier.…”
Section: Discussionmentioning
confidence: 99%
“…However, the pace of computational method development for this assay lags behind the pace of data generation. An open computational problem is the detection of multiplets (i.e., two or more cells/nuclei captured and profiled together)—a common challenge for droplet-based single-cell technologies [ 5 ]. The presence of multiplets confounds downstream analyses, such as cell clustering, annotation, differential accessibility, and allelic accessibility analyses, by introducing combined epigenomic profiles that originate from two or more nuclei.…”
Section: Introductionmentioning
confidence: 99%
“…In fact, all model-based simulators that learn a generative model from real data must ignore certain outlier cells that do not fit well to their model. Some outlier cells could either represent an extremely rare cell type or are just “doublets” [ 100 103 ], artifacts resulted from single-cell sequencing experiments. Hence, our stance is that ignorance of outlier cells is a sacrifice that every simulator has to make; the open question is the degree to which outlier cells should be ignored, and proper answers to this question must resort to statistical model selection principles.…”
Section: Discussionmentioning
confidence: 99%