2020
DOI: 10.48550/arxiv.2002.01822
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Comparing clusterings and numbers of clusters by aggregation of calibrated clustering validity indexes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…In all our examples, there is a "true" class label available, and we make use of these in evaluating the clustering methods we consider. However, in most cases in practice there are no true class labels, and even if there are such labels recovering them may not be the purpose of a cluster analysis (Akhanli and Hennig, 2020). As we have emphasized, a main advantage of our method is the ability to specify what aspects of the data define clusters through the choice of random effects used in defining mixed predictive replicates.…”
Section: Datasetsmentioning
confidence: 99%
“…In all our examples, there is a "true" class label available, and we make use of these in evaluating the clustering methods we consider. However, in most cases in practice there are no true class labels, and even if there are such labels recovering them may not be the purpose of a cluster analysis (Akhanli and Hennig, 2020). As we have emphasized, a main advantage of our method is the ability to specify what aspects of the data define clusters through the choice of random effects used in defining mixed predictive replicates.…”
Section: Datasetsmentioning
confidence: 99%
“…Previous work, including [9], could not conclude superiority of any single validity index. Thus, to obtain a comprehensive understanding of the difficulty of a given dataset, many cluster validity indices could potentially be used in combination [42]. In the context of a fitness function, this could be done in the form of an aggregation of indices or through formulation as a manyobjective problem.…”
Section: Computing the Fitness Of A Datasetmentioning
confidence: 99%