2022
DOI: 10.1002/wics.1575
|View full text |Cite
|
Sign up to set email alerts
|

Stability estimation for unsupervised clustering: A review

Abstract: Cluster analysis remains one of the most challenging yet fundamental tasks in unsupervised learning. This is due in part to the fact that there are no labels or gold standards by which performance can be measured. Moreover, the wide range of clustering methods available is governed by different objective functions, different parameters, and dissimilarity measures. The purpose of clustering is versatile, often playing critical roles in the early stages of exploratory data analysis and as an endpoint for knowled… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
13
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(14 citation statements)
references
References 84 publications
0
13
0
1
Order By: Relevance
“…The tightness measure is defined in [ 40 ], which quantifies the clustering stability. A comprehensive review of clustering stability is referred to [ 41 ]. If the number of Cartesian product clusters at the start of merging is large (for example, more than 100), we conduct a first-stage merging by bipartite clustering.…”
Section: Methodsmentioning
confidence: 99%
“…The tightness measure is defined in [ 40 ], which quantifies the clustering stability. A comprehensive review of clustering stability is referred to [ 41 ]. If the number of Cartesian product clusters at the start of merging is large (for example, more than 100), we conduct a first-stage merging by bipartite clustering.…”
Section: Methodsmentioning
confidence: 99%
“…As one of unsupervised learning approach, clustering is a fundamental concept in entailing the exploration and organization of data without the aid of labelled outcomes. In this approach, the algorithm endeavors to reveal concealed patterns and relationships within the dataset by grouping similar data points into distinct clusters (Liu et al, 2022). Furthermore, clustering is frequently used in any discipline that involves multivariate data analysis (Lipovetsky, 2022).…”
Section: Introductionmentioning
confidence: 99%
“…In addition to internal and external indices, some stability indices evaluate clustering methods based on the clustering stability when the original data are perturbed (resampled, split, subsampled). See [21] for a recent review of these indices. The key idea is to produce perturbed datasets whose distribution is close to the original and apply the clustering methods.…”
Section: Introductionmentioning
confidence: 99%