2017
DOI: 10.14778/3067421.3067425
|View full text |Cite
|
Sign up to set email alerts
|

Local search methods for k-means with outliers

Abstract: We study the problem of k-means clustering in the presence of outliers. The goal is to cluster a set of data points to minimize the variance of the points assigned to the same cluster, with the freedom of ignoring a small set of data points that can be labeled as outliers. Clustering with outliers has received a lot of attention in the data processing community, but practical, efficient, and provably good algorithms remain unknown for the most popular k-means objective. Our work proposes a simple local search-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
50
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 87 publications
(56 citation statements)
references
References 29 publications
0
50
0
1
Order By: Relevance
“…The sample time and the speed values of the road sections are adopted to 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 1 Time (h) perform the experiments. For comparison, K-means [14] and Fuzzy C-means methods [15] are also employed to perform the experiments. As the peak detection methods [25,26] are commonly used to detect the traffic peak periods.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The sample time and the speed values of the road sections are adopted to 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 1 Time (h) perform the experiments. For comparison, K-means [14] and Fuzzy C-means methods [15] are also employed to perform the experiments. As the peak detection methods [25,26] are commonly used to detect the traffic peak periods.…”
Section: Methodsmentioning
confidence: 99%
“…K-means [14] and Fuzzy C-means methods [15] are the most common methods used in TPPD. Clustering methods have achieved good performances.…”
Section: Introductionmentioning
confidence: 99%
“…ANN is used frequently in STLF with good/satisfactory performance. It can model an unspecified nonlinear relationship between load and weather variables [35].…”
Section: Artificial Neural Networkmentioning
confidence: 99%
“…One of the well‐known clustering algorithms that is widely studied is K‐means . It is a simple method to partition data into specified number of clusters (k).…”
Section: Introductionmentioning
confidence: 99%
“…There have been increasing eorts in recent years to study clustering problems from both a theoretical and practical point of view [6,8,11,12].…”
Section: Introductionmentioning
confidence: 99%