2013
DOI: 10.1002/sam.11176
|View full text |Cite
|
Sign up to set email alerts
|

A Bayesian criterion for cluster stability

Abstract: We present a technique for evaluating and comparing how clusterings reveal structure inherent in the data set. Our technique is based on a criterion evaluating how much point-to-cluster distances may be perturbed without affecting the membership of the points. Although similar to some existing perturbation methods, our approach distinguishes itself in five ways. First, the strength of the perturbations is indexed by a prior distribution controlling how close to boundary regions a point may be before it is cons… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 24 publications
0
6
0
Order By: Relevance
“…There are other new methods proposed which can be found, for example, in the works of: Granichin et al (2015), Hosein et al (2011), Koepke, Clarke (2013) and Ryazanov (2016).…”
Section: Discussionmentioning
confidence: 99%
“…There are other new methods proposed which can be found, for example, in the works of: Granichin et al (2015), Hosein et al (2011), Koepke, Clarke (2013) and Ryazanov (2016).…”
Section: Discussionmentioning
confidence: 99%
“…Notice that many penalties could be used to determine the number of clusters and the gap statistic was just one of those. Examples can be found in Koepke and Clarke (2013) . To know whether the performance of our competitors could be significantly improved if other penalties were adopted, we compared our BIC selector based on an unknown with our competitors based on a known .…”
Section: Simulationmentioning
confidence: 99%
“…This method is called resampling, a thorough review of which is provided by Leisch . Koepke and Clarke provide a broad introduction to different categories of methods for measuring stability. We hereby call an overall clustering result a partition .…”
Section: Introductionmentioning
confidence: 99%
“…Luxburg provides a nice overview. Koepke and Clarke proposed a Bayesian criterion for evaluating cluster stability. Their idea is distinct because the perturbation of clustering is not performed by resampling but rather by introducing a nonnegative scaling parameter for each cluster.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation