2016
DOI: 10.1371/journal.pone.0162259
|View full text |Cite
|
Sign up to set email alerts
|

What to Do When K-Means Clustering Fails: A Simple yet Principled Alternative Algorithm

Abstract: The K-means algorithm is one of the most popular clustering algorithms in current use as it is relatively fast yet simple to understand and deploy in practice. Nevertheless, its use entails certain restrictive assumptions about the data, the negative consequences of which are not always immediately apparent, as we demonstrate. While more flexible algorithms have been developed, their widespread use has been hindered by their computational and technical complexity. Motivated by these considerations, we present … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
94
0
5

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 149 publications
(100 citation statements)
references
References 43 publications
1
94
0
5
Order By: Relevance
“…The features of demand are the day types (weekday or weekend) and number of residents, while those for PV include the PV capacity, panel orientation and weather information. Clustering is completed using either k-means clustering [15] or maximum a-posteriori Dirichlet process mixtures (MAP-DP) clustering [16], which is useful for instances in which the number of clusters cannot be easily determined.…”
Section: A Feature-based Cluster Assignmentmentioning
confidence: 99%
“…The features of demand are the day types (weekday or weekend) and number of residents, while those for PV include the PV capacity, panel orientation and weather information. Clustering is completed using either k-means clustering [15] or maximum a-posteriori Dirichlet process mixtures (MAP-DP) clustering [16], which is useful for instances in which the number of clusters cannot be easily determined.…”
Section: A Feature-based Cluster Assignmentmentioning
confidence: 99%
“…Joaquín Pérez-Ortega 1 *, Nelva Nely Almanza-Ortega 1 , Andrea Vega-Villalobos 1 , Rodolfo Pazos-Rangel 2 , Crispín Zavala-Díaz 3…”
Section: Author Detailsunclassified
“…The accelerated progress of technology in recent time is fostering an important increase in the amount of generated and stored data [1][2][3][4] in fields such as engineering, finance, education, medicine, and commerce, among others. Therefore, there is justified interest in obtaining useful knowledge that can be extracted from those huge amounts of data, in order to help making better decisions and understanding the nature of data.…”
Section: Introductionmentioning
confidence: 99%
“…Современные исследователи [36,37] отмечают, что кластеризация данных -сложная задача, предполагающая выбор между различными методами, параметрами и показателями эффективности. Следовательно, особое внимание необходимо уделять выбору алгоритма кластеризации.…”
Section: методы исследованияunclassified