2019
DOI: 10.1109/access.2019.2907885
|View full text |Cite
|
Sign up to set email alerts
|

A Hybrid MPI/OpenMP Parallelization of $K$ -Means Algorithms Accelerated Using the Triangle Inequality

Abstract: The standard formulation of the K -means clustering (Lloyd's method) performs many unnecessary distance calculations. In this paper, we focus on four approaches that use the triangle inequality to avoid unnecessary distance calculations. These approaches are Drake's, Elkan's, Annulus, and Yinyang algorithms. We propose a hybrid MPI/OpenMP parallelization of these algorithms in which the dataset and the corresponding data structures storing bounds on distances are evenly divided among MPI processes. Then, in th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 32 publications
(19 citation statements)
references
References 45 publications
0
19
0
Order By: Relevance
“…We used a version of these last two algorithms parallelized using OpenMP threads. The code implementing the Annulus and Exponion algorithms is a simplification of a hybrid MPI/OpenMP code [44] that does not execute any MPI calls. The last variant is an implementation of Lloyd's method for NVIDIA graphics GPUs from an open source KMCUDA package 1 .…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We used a version of these last two algorithms parallelized using OpenMP threads. The code implementing the Annulus and Exponion algorithms is a simplification of a hybrid MPI/OpenMP code [44] that does not execute any MPI calls. The last variant is an implementation of Lloyd's method for NVIDIA graphics GPUs from an open source KMCUDA package 1 .…”
Section: Resultsmentioning
confidence: 99%
“…Regarding accelerated algorithms, shared-memory parallelization of the triangle inequality-based methods has been described by Hamerly and Drake [28], Ding et al [32], and Newling and Fleuret [31]. Kwedlo and Czochanski [44] proposed a hybrid MPI/OpenMP parallelization of some triangle inequality-based methods for clusters of multiprocessors. As for kd-tree-based methods, a version for distributed systems, parallelized using MPI bindings for Java, was described by Pettinger and Di Fatta [45].…”
Section: Related Researchmentioning
confidence: 99%
See 1 more Smart Citation
“…[29] Computational complexity Multiple kernel k-means with late fusion. [30] Image processing A hybrid parallelization of k-means algorithm.…”
Section: Referencementioning
confidence: 99%
“…W. Kwedlo et al [19] proposed four approaches namely Drake's, Elkan's, Annulus, and Yinyang algorithms to minimize the unnecessary distance calculation in K-means clustering algorithm. In this paper a hybrid MPI/ OpenMP programming models are used to parallelize these algorithms.…”
Section: Related Workmentioning
confidence: 99%