2008
DOI: 10.3844/ajassp.2008.1247.1250
|View full text |Cite
|
Sign up to set email alerts
|

New Efficient Strategy to Accelerate k-Means Clustering Algorithm

Abstract: One of the most popular clustering techniques is the k-means clustering algorithm. However, the utilization of the k-means is severely limited by its high computational complexity. In this study, we propose a new strategy to accelerate the k-means clustering algorithm through the Partial Distance (PD) logic. The proposed strategy avoids many unnecessary distance calculations by applying efficient PD strategy. Experiments show the efficiency of the proposed strategy when applied to different data sets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 23 publications
(18 citation statements)
references
References 16 publications
0
18
0
Order By: Relevance
“…Data clustering techniques are an important aspect used in many fields such as data mining [20], pattern recognition and pattern classification [3], data compression, machine learning [8], image analysis [26], and bioinformatics [22]. The purpose of clustering is to group data points into clusters in which the similar data points are grouped in the same cluster while dissimilar data points are in different clusters.…”
Section: Data Clusteringmentioning
confidence: 99%
“…Data clustering techniques are an important aspect used in many fields such as data mining [20], pattern recognition and pattern classification [3], data compression, machine learning [8], image analysis [26], and bioinformatics [22]. The purpose of clustering is to group data points into clusters in which the similar data points are grouped in the same cluster while dissimilar data points are in different clusters.…”
Section: Data Clusteringmentioning
confidence: 99%
“…While K-Means discovers hard clusters (a point belong to only one cluster), Fuzzy K-Means is a more statistically formalized method and discovers soft clusters where a particular point can belong to more than one cluster with certain probability. In this study, we propose a new strategy to accelerate the k-means clustering algorithm (Al-Zoubi et al, 2008) through the Partial Distance (PD) logic. The Fuzzy K-means accepts an input file containing vector points student and faculty data sets.…”
Section: Methodsmentioning
confidence: 99%
“…Belal et al (2005) proposed a new method for cluster initialization based on finding a set of medians extracted from a dimension with maximum variance. Zoubi et al (2008) proposed a new strategy to accelerate K-means clustering by avoiding unnecessary distance calculations through the partial distance logic. Fahim et al (2009) proposed a method to select a good initial solution by partitioning dataset into blocks and applying K-means to each block.…”
Section: Related Workmentioning
confidence: 99%