2009 50th Annual IEEE Symposium on Foundations of Computer Science 2009
DOI: 10.1109/focs.2009.14
|View full text |Cite
|
Sign up to set email alerts
|

k-Means Has Polynomial Smoothed Complexity

Abstract: The k-means method is one of the most widely used clustering algorithms, drawing its popularity from its speed in practice. Recently, however, it was shown to have exponential worst-case running time. In order to close the gap between practical performance and theoretical analysis, the k-means method has been studied in the model of smoothed analysis. But even the smoothed analyses so far are unsatisfactory as the bounds are still super-polynomial in the number n of data points.In this paper, we settle the smo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
64
0
1

Year Published

2010
2010
2019
2019

Publication Types

Select...
6
4

Relationship

2
8

Authors

Journals

citations
Cited by 78 publications
(65 citation statements)
references
References 28 publications
0
64
0
1
Order By: Relevance
“…Then, the discriminant analysis is applied by leave-one-out cross-validation with k-means clustering. 37 In statistics and data mining, the k-means method is one of the most widely used clustering algorithms with nearest mean, drawing its popularity from its speed in practice. 37,38 Leave-one-out crossvalidation involves single observation from the original sample as the validation data, and the remaining observations as the training data.…”
Section: Data Analysesmentioning
confidence: 99%
“…Then, the discriminant analysis is applied by leave-one-out cross-validation with k-means clustering. 37 In statistics and data mining, the k-means method is one of the most widely used clustering algorithms with nearest mean, drawing its popularity from its speed in practice. 37,38 Leave-one-out crossvalidation involves single observation from the original sample as the validation data, and the remaining observations as the training data.…”
Section: Data Analysesmentioning
confidence: 99%
“…Our work is also related to existing work on characterizing the stability of numerical methods [4,8] as well as the smoothed complexity of algorithms [1,20,21]. The key difference between our work and existing work in these areas is that we explore the smoothness of the value as well as the structure of our results.…”
Section: Related Workmentioning
confidence: 95%
“…Smoothed analysis has originally been invented to explain the practical performance of the simplex method [34]. Since then, smoothed analysis has been applied successfully to a variety of algorithms and problems [2,31,36]. We refer to Spielman and Teng for a survey [35].…”
Section: Smoothed Analysismentioning
confidence: 99%