2020
DOI: 10.1038/s41598-020-71805-1
|View full text |Cite
|
Sign up to set email alerts
|

Northstar enables automatic classification of known and novel cell types from tumor samples

Abstract: Single cell transcriptomics is revolutionising our understanding of tissue and disease heterogeneity, yet cell type identification remains a partially manual task. Published algorithms for automatic cell annotation are limited to known cell types and fail to capture novel populations, especially cancer cells. We developed northstar, a computational approach to classify thousands of cells based on published data within seconds while simultaneously identifying and highlighting new cell states such as malignancie… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 40 publications
0
9
0
Order By: Relevance
“…Aligned sequencing data was displayed in BigWig format, and read counts covering enhancers (Table S2) extracted using deepTools pyBigWig 54 Palantir data were subsampled to 40 cells/type. northstar's subsample method 55 was used to infer cell states within ME-1 guided by Palantir data 6 . For graph construction, 10 external (non-mutual) neighbours were allowed to compensate for the fact that ME-1 cells are quite distant from actual hematopoietic cells.…”
Section: Ngs Data Generation/processingmentioning
confidence: 99%
See 2 more Smart Citations
“…Aligned sequencing data was displayed in BigWig format, and read counts covering enhancers (Table S2) extracted using deepTools pyBigWig 54 Palantir data were subsampled to 40 cells/type. northstar's subsample method 55 was used to infer cell states within ME-1 guided by Palantir data 6 . For graph construction, 10 external (non-mutual) neighbours were allowed to compensate for the fact that ME-1 cells are quite distant from actual hematopoietic cells.…”
Section: Ngs Data Generation/processingmentioning
confidence: 99%
“…We reasoned that a more sophisticated feature selection together with soft guidance from healthy marrow data could reveal additional hidden heterogeneity. We therefore switched from unsupervised clustering to northstar, a semi-supervised clustering algorithm that leverages information from training data to channel the axes of heterogeneity during feature selection, graph construction, and cell community detection 55 . Using healthy marrow transcriptomes 6 (Figure 1A) as training data, this analysis revealed two major subpopulations, HSC-like (pink) and Mono-precursor-like (purple, 1136 and 277 out of 1489 cells respectively) plus a minor population that was more similar to Ery-precursor cells (lime, 47 out of 1489 cells) and two small groups of cells resembling Megakaryocytes (18 cells) and Monocytes (11 cells, Figure 4D).…”
Section: Single Cell Transcriptomics Reveal Key Regulators Of the Hsc...mentioning
confidence: 99%
See 1 more Smart Citation
“…Previous single cell batch correction benchmarking studies evaluated algorithm performance on simulated data or datasets derived from healthy tissues and peripheral blood mononuclear cells [4][5][6] . Despite an abundance of data, no single cell batch-effect correction methods are designed for or benchmarked on single cell datasets containing malignant cells 7 . Due to the inherent biological complexity both within and between tumours, these samples present unique technical challenges for batch correction that are not represented in previous benchmarking efforts.…”
Section: Introductionmentioning
confidence: 99%
“…In tissues with cell types characterized by high-quality reference datasets (Regev 2017, Quake 2021, Allen Institute for Brain Science 2011), cell typing can employ supervised classification, avoiding the instability and interpretation difficulties of clustering. A hybrid approach, semi-supervised cell typing, detects known cell types while also discovering new clusters (Zanini 2020).…”
Section: Introductionmentioning
confidence: 99%