2019
DOI: 10.1609/aaai.v33i01.33015000
|View full text |Cite
|
Sign up to set email alerts
|

On the Persistence of Clustering Solutions and True Number of Clusters in a Dataset

Abstract: Typically clustering algorithms provide clustering solutions with prespecified number of clusters. The lack of a priori knowledge on the true number of underlying clusters in the dataset makes it important to have a metric to compare the clustering solutions with different number of clusters. This article quantifies a notion of persistence of clustering solutions that enables comparing solutions with different number of clusters. The persistence relates to the range of dataresolution scales over which a cluste… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 22 publications
0
3
0
Order By: Relevance
“…Performance indexes including SA, mean intersection-overunion (mIoU) and E-measure [30][31][32] are used to give numerical results of SRG, ISRG, JSEG, algorithm in [13], algorithm in [17] and the proposed algorithm. SA is defined as the ratio of a number of correctly classified pixels to the total number of all pixels.…”
Section: Experiments On Region Growing Based Algorithmsmentioning
confidence: 99%
“…Performance indexes including SA, mean intersection-overunion (mIoU) and E-measure [30][31][32] are used to give numerical results of SRG, ISRG, JSEG, algorithm in [13], algorithm in [17] and the proposed algorithm. SA is defined as the ratio of a number of correctly classified pixels to the total number of all pixels.…”
Section: Experiments On Region Growing Based Algorithmsmentioning
confidence: 99%
“…The number of edges needed to reach maximum correlation depends on the size and nature of the data. Intuitively, the association between D X and D U is related to the concept of persistence of clustering solutions [35] , i.e., if the shape is persistent along the edges, the computed correlation will be consistently similar. Fig.…”
Section: Optional: Visualize the Relationship Between Correlation And The Number Of Edgesmentioning
confidence: 99%
“…The number of edges needed to reach maximum correlation depends on the size and nature of the data. Intuitively, the association between D X and D U is related to the concept of persistence of clustering solutions [34] , i.e., if the shape is persistent along the edges, the computed correlation will be consistently similar. Fig.…”
Section: Optional: Visualize the Relationship Between Correlation And...mentioning
confidence: 99%