2020
DOI: 10.1101/2020.06.19.159749
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

FastPG: Fast clustering of millions of single cells

Abstract: Current single-cell experiments can produce datasets with millions of cells. Unsupervised clustering can be used to identify cell populations in single-cell analysis but often leads to interminable computation time at this scale. This problem has previously been mitigated by subsampling cells, which greatly reduces accuracy. We built on the graph-based algorithm PhenoGraph and developed FastPG which has the same cell assignment accuracy but is on average 27x faster in our tests. FastPG also outperforms two oth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(15 citation statements)
references
References 14 publications
0
15
0
Order By: Relevance
“…To further dissect the data set, we applied the FlowSOM algorithm ( Van Gassen et al, 2015 ), which could not selectively assign iNKT cells to any cluster or meta-cluster even though they were tightly grouped on the t-SNE map ( Fig S16A and B ). To overcome these problems, we implemented our recently described cytoChain application ( Manfredi et al, 2021 ) with FastPhenoGraph (FastPG) algorithm ( Fig S16C ) ( Bodenheimer et al, 2020 Preprint ), which computed 28 clusters that were superimposed on the t-SNE map, allowing both the visualization of their distribution within the data set ( Fig 8B ) and of their phenotype by a heat-map generated based on the fluorescence intensity associated to each expressed marker ( Fig 8C ). For instance, a specific phenotype signature was found to be enriched among CD8 T cells for clusters 21, 17, 24, 13, 27, and 23 that expressed both tissue residence (CD69-CD103) and activation/exhaustion markers to a variable extent (i.e., CD39, CD95, TIGIT, 2B4, PD1, HLA-DR, ICOS, and GITR).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To further dissect the data set, we applied the FlowSOM algorithm ( Van Gassen et al, 2015 ), which could not selectively assign iNKT cells to any cluster or meta-cluster even though they were tightly grouped on the t-SNE map ( Fig S16A and B ). To overcome these problems, we implemented our recently described cytoChain application ( Manfredi et al, 2021 ) with FastPhenoGraph (FastPG) algorithm ( Fig S16C ) ( Bodenheimer et al, 2020 Preprint ), which computed 28 clusters that were superimposed on the t-SNE map, allowing both the visualization of their distribution within the data set ( Fig 8B ) and of their phenotype by a heat-map generated based on the fluorescence intensity associated to each expressed marker ( Fig 8C ). For instance, a specific phenotype signature was found to be enriched among CD8 T cells for clusters 21, 17, 24, 13, 27, and 23 that expressed both tissue residence (CD69-CD103) and activation/exhaustion markers to a variable extent (i.e., CD39, CD95, TIGIT, 2B4, PD1, HLA-DR, ICOS, and GITR).…”
Section: Resultsmentioning
confidence: 99%
“…If multiple .FCS files are uploaded they must be merged (concatenated) before the dimensionality reduction process. Clustering on the reported CRC-LM data set was run both with by FlowSOM ( Van Gassen et al, 2015 ) and FastPG ( Bodenheimer et al, 2020 Preprint ) algorithms. In our case, we estimated that FastPG outperformed FlowSOM.…”
Section: Methodsmentioning
confidence: 99%
“…To visualize the integrated data, we performed dimension reduction by PCA and used the top 50 PCs for Uniform Manifold Approximation and Projection (UMAP) ( 21 ). FastPG, a fast phenograph-like clustering method, was used to cluster the cells ( 22 ). Finally, we performed cell type identification of clusters using CELLiD.…”
Section: Methodsmentioning
confidence: 99%
“…Consequently, efficient parallel algorithms can play a significant role in enabling single-cell analysis. The ExaGraph team integrated their scalable community detection method with a newly developed tool named FastPG (Bodenheimer et al, 2020). FastPG builds on the state-of-the-art-method graph-based algorithm PhenoGraph by parallelizing key steps in the workflow, where the final step is clustering using Grappolo .…”
Section: Combinatorial Approaches For Graph Algorithmsmentioning
confidence: 99%
“…Using a set of standard datasets with known ground truth, the team demostrates that FastPG has the same cell assignment accuracy but is on average 27× faster than other tools. FastPG also has higher cell assignment accuracy than two other fast clustering methods, FlowSOM and PARC (Bodenheimer et al, 2020).…”
Section: Combinatorial Approaches For Graph Algorithmsmentioning
confidence: 99%