2009 World Congress on Nature &Amp; Biologically Inspired Computing (NaBIC) 2009
DOI: 10.1109/nabic.2009.5393623
|View full text |Cite
|
Sign up to set email alerts
|

Clustering categorical data using a swarm-based method

Abstract: Abstract-The K-Modes algorithm is one of the most popular clustering algorithms in dealing with categorical data. But the random selection of starting centers in this algorithm may lead to different clustering results and falling into local optima. In this paper we proposed a swarm-based K-Modes algorithm. The experimental results over two well known Soybean and Congressional voting categorical data sets show that our method can find the optimal global solutions and can make up the K-Modes shortcoming.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 7 publications
0
5
0
Order By: Relevance
“…Distance is either entropy [41] or Hamming distance [43]. Other algorithms have been proposed to improve optimization of the k-modes objective function without repeated reinitialization, including a tabu search [50], a genetic algorithm [51], a partical swarm optimizer [52], an artificial bee colony optimizer [39] and a Cuckoo search [53]. Most global optimizers and some initializers employ the traditional k-modes algorithms within their approach, and hence could benefit from use of the OTQT or OT algorithms.…”
Section: B Resultsmentioning
confidence: 99%
“…Distance is either entropy [41] or Hamming distance [43]. Other algorithms have been proposed to improve optimization of the k-modes objective function without repeated reinitialization, including a tabu search [50], a genetic algorithm [51], a partical swarm optimizer [52], an artificial bee colony optimizer [39] and a Cuckoo search [53]. Most global optimizers and some initializers employ the traditional k-modes algorithms within their approach, and hence could benefit from use of the OTQT or OT algorithms.…”
Section: B Resultsmentioning
confidence: 99%
“…, z ld } for 1 ≤ l ≤ k where k is the number of clusters. Assume X and Y are two categorical data objects in D, the simple matching distance measure between X and Y is defined as [31] :…”
Section: K-modes Algorithm(k-ma)mentioning
confidence: 99%
“…This algorithm identifies the modes using a frequency-based technique by solving the following mathematical problem [31].…”
Section: K-modes Algorithm(k-ma)mentioning
confidence: 99%
“…Particle Swarm Optimization (PSO) is combined with kmodes clustering algorithm is proposed in [12]. PSO is one of the popular metaheuristic and swarm intelligence based optimization algorithms.…”
Section: Related Workmentioning
confidence: 99%