2023
DOI: 10.3991/ijoe.v19i04.37059
|View full text |Cite
|
Sign up to set email alerts
|

Determining the Optimal Number of Clusters using Silhouette Score as a Data Mining Technique

Abstract: The identification of the same objects is very important in determining the similarity between different objects. Nowadays, there are several techniques that allow us to divide objects into different groups that differ from one to another. In order to have the best separation between the clusters, it is required that the optimal determination of the number of clusters of a corpus be made in advance. In our research, the Silhouette score technique was used in order to make the optimal determination of this numb… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 6 publications
0
1
0
Order By: Relevance
“…Clustering is done into two clusters (expecting that they will correspond to investigated Tiers), for all participants and for each scenario, separately for guided and self-produced parts. The quality of clusterization into two clusters has been measured by the Silhouette score 56 . That score takes values from -1 to 1, and positive values, preferably higher than 0.5, prove correct clusterization.…”
Section: Resultsmentioning
confidence: 99%
“…Clustering is done into two clusters (expecting that they will correspond to investigated Tiers), for all participants and for each scenario, separately for guided and self-produced parts. The quality of clusterization into two clusters has been measured by the Silhouette score 56 . That score takes values from -1 to 1, and positive values, preferably higher than 0.5, prove correct clusterization.…”
Section: Resultsmentioning
confidence: 99%