2020
DOI: 10.3233/fi-2020-1909
|View full text |Cite
|
Sign up to set email alerts
|

On the Consistency of k-means++ algorithm

Abstract: We prove in this paper that the expected value of the objective function of the k-means++ algorithm for samples converges to population expected value. As k-means++, for samples, provides with constant factor approximation for k-means objectives, such an approximation can be achieved for the population with increase of the sample size.This result is of potential practical relevance when one is considering using subsampling when clustering large data sets (large data bases).

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 10 publications
0
1
0
Order By: Relevance
“…The k-means like algorithms are of interest not only because their efficiency, but also due to interesting theoretical property like consistency in the limit for growing sample sizes [12], applicability of kernel-trick for Euclidean [11] and non-Euclidean space [16], existence of cluster-preserving transformations [13], possibility to check clusterability [14], applicability in label-free test set evaluations [22], privacy preserving [26] and many other [10], [17], [15]. Therefore, intense studies of k-means family are vital.…”
Section: Previous Workmentioning
confidence: 99%
“…The k-means like algorithms are of interest not only because their efficiency, but also due to interesting theoretical property like consistency in the limit for growing sample sizes [12], applicability of kernel-trick for Euclidean [11] and non-Euclidean space [16], existence of cluster-preserving transformations [13], possibility to check clusterability [14], applicability in label-free test set evaluations [22], privacy preserving [26] and many other [10], [17], [15]. Therefore, intense studies of k-means family are vital.…”
Section: Previous Workmentioning
confidence: 99%