2021
DOI: 10.1093/bioinformatics/btab641
|View full text |Cite
|
Sign up to set email alerts
|

PsiNorm: a scalable normalization for single-cell RNA-seq data

Abstract: Motivation Single-cell RNA sequencing (scRNA-seq) enables transcriptome-wide gene expression measurements at single-cell resolution providing a comprehensive view of the compositions and dynamics of tissue and organism development. The evolution of scRNA-seq protocols has led to a dramatic increase of cells throughput, exacerbating many of the computational and statistical issues that previously arose for bulk sequencing. In particular, with scRNA-seq data all the analyses steps, including no… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(18 citation statements)
references
References 34 publications
0
18
0
Order By: Relevance
“…These problems have significant implications for common tasks such as finding marker genes, as discussed above. Newer methods that explicitly couple statistical methods with software engineering considerations are needed; we examined several recent publications proposing new ideas but restricted the paper to widely used methods common in existing workflows (Brown et al 2021; Breda, Zavolan, and van Nimwegen 2021; Borella et al 2021; Bacher et al 2017). A detailed analysis and review of these methods is an important next step.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…These problems have significant implications for common tasks such as finding marker genes, as discussed above. Newer methods that explicitly couple statistical methods with software engineering considerations are needed; we examined several recent publications proposing new ideas but restricted the paper to widely used methods common in existing workflows (Brown et al 2021; Breda, Zavolan, and van Nimwegen 2021; Borella et al 2021; Bacher et al 2017). A detailed analysis and review of these methods is an important next step.…”
Section: Discussionmentioning
confidence: 99%
“…Despite the complexity of normalization in practice, much work on scRNAseq has focused on statistical details that, while important, are not necessarily the primary determinants of results. For example, the debate over whether gene-specific over-dispersion parameters should be used when computing Pearson residuals (Hafemeister and Satija 2019; Lause, Berens, and Kobak 2021; Hafemeister and Satija 2020; Choudhary and Satija 2022) ignores the fact that Pearson residuals are not the result of a monotonic transformation, and they create dense matrices that can lead to significant analysis limitations (Borella et al 2021). These problems have significant implications for common tasks such as finding marker genes, as discussed above.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…We used a commonly applied normalization protocol depth scaling or count per million (CPM) normalization and subsequently log1p transformation. This protocol normalizes count data by a factor proportional to the count depth per cell [ 19 ]. While normalization is used to remove count sampling effects, the resulting dataset still contains unwanted variability due to dropout events, the expression effects of cell cycles, and technical inconsistencies such as batch differences.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, the use of an appropriate data normalization method can minimize such a kind of errors and allow making a comparison between the results of gene expression obtained in different experiments. Using the assumption about the power-law distribution of gene expression in the works [14,15] the appropriate methods of normalization were developed.…”
Section: Introductionmentioning
confidence: 99%