2022
DOI: 10.48550/arxiv.2208.01261
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Are Cluster Validity Measures (In)valid?

Marek Gagolewski,
Maciej Bartoszuk,
Anna Cena

Abstract: Internal cluster validity measures (such as the Caliński-Harabasz, Dunn, or Davies-Bouldin indices) are frequently used for selecting the appropriate number of partitions a dataset should be split into. In this paper we consider what happens if we treat such indices as objective functions in unsupervised learning activities. Is the optimal grouping with regards to, say, the Silhouette index really meaningful? It turns out that many cluster (in)validity indices promote clusterings that match expert knowledge qu… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 47 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?