2019
DOI: 10.3390/j2020016
|View full text |Cite
|
Sign up to set email alerts
|

Research on K-Value Selection Method of K-Means Clustering Algorithm

Abstract: Among many clustering algorithms, the K-means clustering algorithm is widely used because of its simple algorithm and fast convergence. However, the K-value of clustering needs to be given in advance and the choice of K-value directly affect the convergence result. To solve this problem, we mainly analyze four K-value selection algorithms, namely Elbow Method, Gap Statistic, Silhouette Coefficient, and Canopy; give the pseudo code of the algorithm; and use the standard data set Iris for experimental verificati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
279
1
16

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 549 publications
(359 citation statements)
references
References 11 publications
1
279
1
16
Order By: Relevance
“…EMD has been reported to induce the production of anti-EMD antibodies in the host [11]; it is generally recognized that only peptides of greater than approximately 10 residues (or > 5 kDa) can function as antigens [12,13]. Thus, because SP is only seven amino acids in length and 1,118 Da in molecular mass, it exhibits very little risk of eliciting an immunological response.…”
Section: Introductionmentioning
confidence: 99%
“…EMD has been reported to induce the production of anti-EMD antibodies in the host [11]; it is generally recognized that only peptides of greater than approximately 10 residues (or > 5 kDa) can function as antigens [12,13]. Thus, because SP is only seven amino acids in length and 1,118 Da in molecular mass, it exhibits very little risk of eliciting an immunological response.…”
Section: Introductionmentioning
confidence: 99%
“…The Elbow method is a -value method that determines the best value by looking at the number of places that will form Elbouw at a point [8]. Performance indicators use number of squared errors (SSE).Clusters are said to be convergent when obtaining a smaller value compared to others [2]. SSE formula in K-Means [8]: (5) The Elbow algorithm method in K-Means as follows [9]: 1.…”
Section:  Silhouette Methodsmentioning
confidence: 99%
“…Input data that has been calculated by Euclidean distance will be one cluster if the distance between the input data with centroid has the smallest value compared to the distance between the input data and centroid on the other cluster. The new centroid of cluster is calculated using the formula: (2) where:…”
Section: A K-meansmentioning
confidence: 99%
See 2 more Smart Citations