Proceedings of the Twenty-Second Annual ACM-SIAM Symposium on Discrete Algorithms 2011
DOI: 10.1137/1.9781611973082.3
|View full text |Cite
|
Sign up to set email alerts
|

Streaming k-means on Well-Clusterable Data

Abstract: One of the central problems in data-analysis is k-means clustering. In recent years, considerable attention in the literature addressed the streaming variant of this problem, culminating in a series of results (Har-Peled and Mazumdar; Frahling and Sohler; Frahling, Monemizadeh, and Sohler; Chen) that produced a (1 + ε)-approximation for k-means clustering in the streaming setting. Unfortunately, since optimizing the k-means objective is Max-SNP hard, all algorithms that achieve a (1 + ε)-approximation must tak… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
0
1

Year Published

2013
2013
2019
2019

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 39 publications
(35 citation statements)
references
References 20 publications
0
34
0
1
Order By: Relevance
“…It is implicit in [23, Secs. 3.1-3.2] that metric k-median has a determinis- Many other algorithms are known for weighted or unweighted metric k-median selection [5,12,14,27,34], its online, streaming and non-metric variants [1,8,13,31,33] as well as the related problems of metric facility location [6,11,15,32,38], metric k-means selection [2,4,28] and allpairs shortest paths [7,9,[16][17][18]37]. All these algorithms, unlike ours, either randomize or inspect the entire available data.…”
Section: Introductionmentioning
confidence: 99%
“…It is implicit in [23, Secs. 3.1-3.2] that metric k-median has a determinis- Many other algorithms are known for weighted or unweighted metric k-median selection [5,12,14,27,34], its online, streaming and non-metric variants [1,8,13,31,33] as well as the related problems of metric facility location [6,11,15,32,38], metric k-means selection [2,4,28] and allpairs shortest paths [7,9,[16][17][18]37]. All these algorithms, unlike ours, either randomize or inspect the entire available data.…”
Section: Introductionmentioning
confidence: 99%
“…Later, an algorithm storing only O(k log 2 n) points was provided [10]. The current state-of-the-art O(1)-approximation stores O(k log n) points [6]. A variety of other results are known for Euclidean space.…”
Section: Regimementioning
confidence: 99%
“…The probability of K-assembles was showed up as before timetable as 1956 by Steinhaus [15]. A basic neighborhood heuristic for issue was proposed in 1957 by Lloyds [16].…”
Section: Iirealted Workmentioning
confidence: 99%