2016
DOI: 10.1101/036863
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Fast and accurate single-cell RNA-Seq analysis by clustering of transcript-compatibility counts

Abstract: Current approaches to single-cell transcriptomic analysis are computationally intensive and require assay-specific modeling, which limits their scope and generality. We propose a novel method that compares and clusters cells based on their transcript-compatibility read counts rather than on the transcript or gene quantifications used in standard analysis pipelines. In the reanalysis of two landmark yet disparate single-cell RNA-seq datasets, we show that our method is up to two orders of magnitude faster than … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
33
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
5
4

Relationship

3
6

Authors

Journals

citations
Cited by 22 publications
(33 citation statements)
references
References 54 publications
0
33
0
Order By: Relevance
“…As previously proposed in [11], the equivalence class [15,[19][20][21][22][23][24] that gives rise to a fragment encodes some positional information, by means of encoding the set of transcripts to which the fragment is mapped. This can be used to avoid over-collapsing UMI tags that are likely to result from different molecules by considering UMIs as distinct per equivalence class.…”
Section: Mapping Reads and Umi De-duplicationmentioning
confidence: 99%
See 1 more Smart Citation
“…As previously proposed in [11], the equivalence class [15,[19][20][21][22][23][24] that gives rise to a fragment encodes some positional information, by means of encoding the set of transcripts to which the fragment is mapped. This can be used to avoid over-collapsing UMI tags that are likely to result from different molecules by considering UMIs as distinct per equivalence class.…”
Section: Mapping Reads and Umi De-duplicationmentioning
confidence: 99%
“…Second, and more importantly, the output of the pipelines is not of the same basic form. While alevin, by default, produces a gene-level count matrix, the kb-pipe method produces transcript compatibility counts [11] (i.e. equivalence class counts) as output.…”
Section: Comparing Alevin To Existing Approachesmentioning
confidence: 99%
“…formance is nontrivial. While there has been some work examining the dependence of principal components analysis on read depth [4,5], the methods used do not extend quantitatively to other multivariate analyses.…”
Section: Introductionmentioning
confidence: 99%
“…Modern advances in single-cell technologies can cheaply generate genomic profiles of millions of individual cells [1,2]. Depending on the type of assay, these profiles can describe cell features such as RNA expression, transcript compatability counts [3], epigenetic features [4], or nuclear RNA expression [5]. Because the cell types of individual cells often cannot be known prior to the computational step, a key step in single-cell computational pipelines [6,7,8,9,10] is clustering: organizing individual cells into biologically meaningful populations.…”
mentioning
confidence: 99%