2021
DOI: 10.1101/2021.05.05.442755
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

STARsolo: accurate, fast and versatile mapping/quantification of single-cell and single-nucleus RNA-seq data

Abstract: We present STARsolo, a comprehensive turnkey solution for quantifying gene expression in single-cell/nucleus RNA-seq data, built into RNA-seq aligner STAR. Using simulated data that closely resembles realistic scRNA-seq, we demonstrate that STARsolo is highly accurate and significantly outperforms pseudoalignment-to-transcriptome tools. STARsolo can replicate the results of, but is considerably faster than CellRanger, currently the most widely used tool for pre-processing scRNA-seq data. In addition to uniquel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

6
254
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 258 publications
(260 citation statements)
references
References 49 publications
6
254
0
Order By: Relevance
“…In a third preprint the group from STARsolo performed a benchmark of STARsolo, Alevin, and Kallisto and claimed that STARsolo is more precise and outperforms the pseudo-alignment tools Alevin and Kallisto with simulated data. With a real dataset STARsolo replicated the results from Cell Ranger significantly faster while consuming much less memory [ 20 ].…”
Section: Introductionmentioning
confidence: 99%
“…In a third preprint the group from STARsolo performed a benchmark of STARsolo, Alevin, and Kallisto and claimed that STARsolo is more precise and outperforms the pseudo-alignment tools Alevin and Kallisto with simulated data. With a real dataset STARsolo replicated the results from Cell Ranger significantly faster while consuming much less memory [ 20 ].…”
Section: Introductionmentioning
confidence: 99%
“…Barcode reads were trimmed to exclude the intersub-barcode PCR adapters using a mawk script. Reads were then mapped and cell-demultiplexed using STARsolo 40 in CB_UMI_Complex mode. The resulting STARsolo-filtered count matrices were further analysed using Scanpy 41 .…”
Section: Methodsmentioning
confidence: 99%
“…The targeted exome and the whole genome sequencing reads were aligned to the latest version of the human genome reference (GRCh38, Dec 2013) using BWA v.0.7.17 default settings [20]. The pooled sequencing reads from the scRNA-seq datasets were aligned using the STARsolo module of STAR v.2.7.7a in 2-pass mode, with transcript annotations from the assembly GRCh38.79 [12,21]. To generate individual cell alignments we adopted a publicly available python script that splits the pooled scRNA-seq alignments, based on cellular barcode [22].…”
Section: Data Processing: Alignment Processing Generation Of Individual Scrna-seq Alignmentsmentioning
confidence: 99%
“…For this analysis, we processed the scRNA-seq datasets as we have previously described [11,18]. Briefly, after alignment with STARsolo [12] and quality filtering, the gene-expression matrices were processed using Seurat [25] to normalize gene expression, and corrected for batch-and cell-cycle effects. The normalized gene expression values were then used to assign likely cell types using SingleR and to provide context for cells carrying particular SNVs [26] (Methods).…”
Section: Scesnvs Expressionmentioning
confidence: 99%
See 1 more Smart Citation