2004
DOI: 10.1007/978-3-540-30501-9_54
|View full text |Cite
|
Sign up to set email alerts
|

Parallel K-Means Clustering Algorithm on DNA Dataset

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0
1

Year Published

2012
2012
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 3 publications
0
3
0
1
Order By: Relevance
“…A parallel implementation of the k-means clustering algorithm on a cluster of personal computers (PCs) was described in [27]. The proposed algorithm is parallelised based on the inherent data-parallelism especially in the distance calculation and centroid update operations for DNA dataset.…”
Section: Related Workmentioning
confidence: 99%
“…A parallel implementation of the k-means clustering algorithm on a cluster of personal computers (PCs) was described in [27]. The proposed algorithm is parallelised based on the inherent data-parallelism especially in the distance calculation and centroid update operations for DNA dataset.…”
Section: Related Workmentioning
confidence: 99%
“…Many researchers tried to parallelize the k-means algorithm using either parallel or distributed systems. In [5], the authors used the MapReduce framework and the Hadoop Distributed File System (HDFS) in order to distribute the computation workload among the nodes of the system. The proposed techniques applied the k-means locally on the nodes of the system and the results produced were used by the master node in order to produce global centroids applying again the k-means on them.…”
Section: Review Of Litera Turementioning
confidence: 99%
“…Although the complexity of k-means is not high, O(I knm), for I iterations, K clusters, n instances, and m features, this implementation may require a long time if the number of iterations for convergence is large. Othman et al [101] developed a similar solution for clustering DNA data. There are many other parallel versions of k-means based on the principle of data distribution [73].…”
Section: Parallelizationmentioning
confidence: 99%