2022
DOI: 10.1101/2022.01.31.478592
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ClustAssess: tools for assessing the robustness of single-cell clustering

Abstract: The transition from bulk to single-cell analyses refocused the computational challenges for high-throughput sequencing data-processing. The core of single-cell pipelines is partitioning cells and assigning cell-identities; extensive consequences derive from this step; generating robust and reproducible outputs is essential. From benchmarking established single-cell pipelines, we observed that clustering results critically depend on algorithmic choices (e.g. method, parameters) and technical details (e.g. rando… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

3
6

Authors

Journals

citations
Cited by 13 publications
(13 citation statements)
references
References 74 publications
0
13
0
Order By: Relevance
“…However, Cytocipher was able to identify the transcriptionally distinct populations. Importantly, several methods have been developed concerned with varying cluster hyper-parameters to identify an optimal parameter set [32][33][34][35][36][37][38][39] . While the criteria to define 'optimal' clustering is tool dependent 2-39 , the tool Clustree 39 for example, defines this as the highest resolution clustering where further increasing the clustering resolution results in arbitrary rearrangements of cells to increase the number of clusters.…”
Section: Discussionmentioning
confidence: 99%
“…However, Cytocipher was able to identify the transcriptionally distinct populations. Importantly, several methods have been developed concerned with varying cluster hyper-parameters to identify an optimal parameter set [32][33][34][35][36][37][38][39] . While the criteria to define 'optimal' clustering is tool dependent 2-39 , the tool Clustree 39 for example, defines this as the highest resolution clustering where further increasing the clustering resolution results in arbitrary rearrangements of cells to increase the number of clusters.…”
Section: Discussionmentioning
confidence: 99%
“…A 20-nearest neighbour graph was computed on the first 30 PCs of the data; the cells were clustered using SLM community detection [ 60 ] on the NN graph. To assess the clustering similarity, element-centric clustering comparison [ 61 ] was employed on the set of common barcodes between GT and S using the ClustAssess R package [ 62 ]. Cluster markers were identified using the ROC test in Seurat v3.1.4 [ 56 ]; only genes with were considered.…”
Section: Materials and Methodsmentioning
confidence: 99%
“…Dimensionality reductions [PCA followed by UMAP (83)], as well as clustering [Louvain algorithm (84)], were conducted in Seurat; the optimal number of clusters was selected on the basis of a stability analysis using the ClustAssess package (v0.3.0) (85). Following an assessment of the stability of clustering results, for subsequent steps, we focused on the 3000 most variable genes, across all cells in the dataset.…”
Section: Single-cell Rna Sequencing Analysismentioning
confidence: 99%