2018
DOI: 10.1101/gr.230771.117
|View full text |Cite
|
Sign up to set email alerts
|

bigSCale: an analytical framework for big-scale single-cell data

Abstract: Single-cell RNA sequencing (scRNA-seq) has significantly deepened our insights into complex tissues, with the latest techniques capable of processing tens of thousands of cells simultaneously. Analyzing increasing numbers of cells, however, generates extremely large data sets, extending processing time and challenging computing resources. Current scRNA-seq analysis tools are not designed to interrogate large data sets and often lack sensitivity to identify marker genes. With bigSCale, we provide a scalable ana… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
54
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 73 publications
(55 citation statements)
references
References 45 publications
1
54
0
Order By: Relevance
“…The expression estimates of 112,593 isoforms are provided by the Conquer project (Soneson and Robinson, 2018). • Neuronal progenitor cells (NPCs) also form two groups, one from the patient and the other from a healthy donor (Iacono et al, 2018), 360 cells in each group. The expression estimates of 41,020 genes are provided by the bigSCale project (Iacono et al, 2018).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The expression estimates of 112,593 isoforms are provided by the Conquer project (Soneson and Robinson, 2018). • Neuronal progenitor cells (NPCs) also form two groups, one from the patient and the other from a healthy donor (Iacono et al, 2018), 360 cells in each group. The expression estimates of 41,020 genes are provided by the bigSCale project (Iacono et al, 2018).…”
Section: Resultsmentioning
confidence: 99%
“…The third real data set (NPCs) is a subset of GSE102934 data from the NCBI Gene Expression Omnibus (Iacono et al, 2018). This data set has 720 NPCs derived from induced pluripotent stem (iPS) cells, half of which are from a Williams-Beuren patient and the other half are from a healthy donor.…”
Section: Experimental and Synthetic Data Setsmentioning
confidence: 99%
“…For large programs, e.g., the Human Cell Atlas project [ 148 ], the volume of data demands more robust computer hardware and software. Although a few down-sampling or convolution-based methods have been proposed to manage large-scale scRNA-seq data for clustering and differential expression analysis [ 149 151 ], efficient and effective algorithms are of pressing need to circumvent these difficulties.…”
Section: Overview Of Single-cell Sequencing and Analysismentioning
confidence: 99%
“…The second category of methods focuses on modeling GRNs within-cell populations without considering cell trajectories or dynamics. These methods include coexpression and TF-based [88,94,95], coexpression and TF-independent [89,90,109], and information theory [91] (Table 1 and Figure 1B). This is in line with the basic concepts underlying GRN modeling of gene-gene interactions for a tissue, except here single-cell data are modeled for specific cell populations.…”
Section: Within-cell Population Networkmentioning
confidence: 99%
“…The former calculates pairwise Spearman rank correlations between all sets of genes across cells within a cell type to infer cell-type GRNs in hematopoiesis, and significant pairwise associations were identified using the odds ratio of linearly transformed expression data. Iacono et al [109] used a Pearson correlation-based method which first transforms the expression values using bigSCale to derive a z-score for each gene using a probabilistic model to account for noise and variability inherent to single-cell data. Pairwise correlations of z-scores are used to construct GRNs.…”
Section: Within-cell Population Networkmentioning
confidence: 99%