2022
DOI: 10.3390/app12157515
|View full text |Cite
|
Sign up to set email alerts
|

AutoElbow: An Automatic Elbow Detection Method for Estimating the Number of Clusters in a Dataset

Abstract: The elbow technique is a well-known method for estimating the number of clusters required as a starting parameter in the K-means algorithm and certain other unsupervised machine-learning algorithms. However, due to the graphical output nature of the method, human assessment is necessary to determine the location of the elbow and, consequently, the number of data clusters. This article presents a simple method for estimating the elbow point, thus, enabling the K-means algorithm to be readily automated. First, t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
10
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 27 publications
(10 citation statements)
references
References 17 publications
0
10
0
Order By: Relevance
“…To provide spatially variable coastal dynamics characteristics, we used the K-means algorithm 37 to cluster the HTF threshold’s input data into different categories. The elbow approach 38 is used to determine the optimal number of clusters. This approach plots the number of clusters against the total distance between data points and their respective cluster centroids and identifies the elbow point on the plot, which indicates a trade-off between minimizing the distance and mitigating the complexity associated with an excessive number of clusters.…”
Section: Resultsmentioning
confidence: 99%
“…To provide spatially variable coastal dynamics characteristics, we used the K-means algorithm 37 to cluster the HTF threshold’s input data into different categories. The elbow approach 38 is used to determine the optimal number of clusters. This approach plots the number of clusters against the total distance between data points and their respective cluster centroids and identifies the elbow point on the plot, which indicates a trade-off between minimizing the distance and mitigating the complexity associated with an excessive number of clusters.…”
Section: Resultsmentioning
confidence: 99%
“…L-method places two straight lines on the elbow curve. One line extends from the head of the curve to the candidate point on the curve, while the other line extends from the tail of the curve to the candidate point on the same curve [77]. Using this technique, we first compute the Bayesian Information Criterion (BIC) values, as well as the first-order differences.…”
Section: L-methods Techniquementioning
confidence: 99%
“…However, with the L-method, long tails within the curve can influence the value of the optimal number of clusters, k [77]. To minimize this effect, a proposed iterative method aims to gradually decrease the tail while simultaneously refining the point of the elbow.…”
Section: L-methods Techniquementioning
confidence: 99%
See 1 more Smart Citation
“…By observing the WCSS curve, we looked for a point where the rate of decrease in WCSS significantly slowed down, creating an elbow in the plot. The K value at this elbow point is considered the optimal number of clusters, as it indicates a trade-off between maximizing the number of clusters and minimizing WCSS [53].…”
mentioning
confidence: 99%