2019
DOI: 10.1101/776948
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

scClassify: hierarchical classification of cells

Abstract: 17Cell type identification is a key computational challenge in single-cell RNA-sequencing 18 (scRNA-seq) data. To capitalize on the large collections of well-annotated scRNA-seq datasets, 19 we present scClassify, a hierarchical classification framework based on ensemble learning. 20 scClassify can identify cells from published scRNA-seq datasets more accurately and more 21 finely than in the original publications. We also estimate the cell number needed for accurate 22 classification anywhere in a cell type h… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 49 publications
(59 reference statements)
0
11
0
Order By: Relevance
“… For N = 1, P ( S 1 ) is equal to the Hill equation, where k i represents the concentration of H i at half-occupation and n i represents the Hill coefficient. Typically, n i is between [1,10] The Hill equation can be simplified by letting . Since P ( S 0 ) = 1 − P ( S 1 ), the activation function is formulated and simplified as follows.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“… For N = 1, P ( S 1 ) is equal to the Hill equation, where k i represents the concentration of H i at half-occupation and n i represents the Hill coefficient. Typically, n i is between [1,10] The Hill equation can be simplified by letting . Since P ( S 0 ) = 1 − P ( S 1 ), the activation function is formulated and simplified as follows.…”
Section: Methodsmentioning
confidence: 99%
“…Other generators of scRNA-seq data (e.g. splatter [1], powsimR [4], PROSSTT [5] and SymSim [6]) have already been used extensively to explore the strengths and weaknesses of computational tools, both by method developers [7, 8, 9, 10] and independent benchmarkers [11, 12, 13]. However, a limitation of these existing simulators is that they would require significant methodological alterations to add additional modalities or experimental conditions (Table S1).…”
Section: Main Textmentioning
confidence: 99%
“…writing emails to the authors). Cell type annotations are difficult to find and often of mediocre quality, presenting a challenge in particular to algorithms such as scClassify that rely on entire cell type hierarchies 17 . Although northstar is less affected because it does not require a hierarchical ontology, we aspire to change this trend by providing a website with averages and subsamples for several atlases that can be accessed both by humans and programmatically: https ://north stara tlas.githu b.io/atlas _landm arks.…”
Section: Discussionmentioning
confidence: 99%
“…In practice, these methods define marker genes for known cell types and build classifiers to assign new cells to these cell types. In particular, Garnett [29] allows a hierarchical clustering structure, but one that needs to be predefined, and scClassify [30] uses the HOPACH [31] algorithm to establish a hierarchy in the training dataset. Most of these algorithms can also identify new cell types not present in the reference.…”
Section: Discussionmentioning
confidence: 99%