2020
DOI: 10.17977/um018v3i12020p40-49
|View full text |Cite
|
Sign up to set email alerts
|

Parallelization of Partitioning Around Medoids (PAM) in K-Medoids Clustering on GPU

Abstract: K-medoids clustering is categorized as partitional clustering. K-medoids offers better result when dealing with outliers and arbitrary distance metric also in the situation when the mean or median does not exist within data. However, k-medoids suffers a high computational complexity. Partitioning Around Medoids (PAM) has been developed to improve k-medoids clustering, consists of build and swap steps and uses the entire dataset to find the best potential medoids. Thus, PAM produces better medoids than other al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 28 publications
0
3
0
Order By: Relevance
“…The clustering algorithm of choice was PAM (partitioning around medoids) because of its robustness, low sensitivity to outliers and its ability to preselect the number of clusters [ 40 , 41 ]. The elbow method was applied to give insight into the optimal number of cluster groups [ 42 ], while the silhouette method was used to assess the quality of the resulting partitions produced by PAM [ 41 – 43 ]. To guide the interpretation of the cluster analysis, summary statistics of the epi-features stratified by cluster were calculated, and maps of the classification were produced.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The clustering algorithm of choice was PAM (partitioning around medoids) because of its robustness, low sensitivity to outliers and its ability to preselect the number of clusters [ 40 , 41 ]. The elbow method was applied to give insight into the optimal number of cluster groups [ 42 ], while the silhouette method was used to assess the quality of the resulting partitions produced by PAM [ 41 – 43 ]. To guide the interpretation of the cluster analysis, summary statistics of the epi-features stratified by cluster were calculated, and maps of the classification were produced.…”
Section: Methodsmentioning
confidence: 99%
“…Epifeatures were standardized, with the aim of normalizing the data to establish a common scale, so that there is no distortion of values when there are larger ranges. The clustering algorithm of choice was PAM (partitioning around medoids) because of its robustness, low sensitivity to outliers and its ability to preselect the number of clusters [40,41]. The elbow method was applied to give insight into the optimal number of cluster groups [42], while the silhouette method was used to assess the quality of the resulting partitions produced by PAM [41][42][43].…”
Section: Plos Neglected Tropical Diseases (S1 Fig)mentioning
confidence: 99%
“…It also has a high cluster accuracy compared to other distance-based partitioning algorithms for mixed variable data [22]. The PAM algorithm is used to reduce computational time at the swap step [25]. And also PAM is parallelization of on GPU.…”
Section: Research Backgroundmentioning
confidence: 99%