2017
DOI: 10.21917/ijsc.2017.0218
|View full text |Cite
|
Sign up to set email alerts
|

CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM

Abstract: Cluster analysis is the unsupervised learning technique that finds the interesting patterns in the data objects without knowing class labels. Most of the real world dataset consists of categorical data. For example, social media analysis may have the categorical data like the gender as male or female. The k-modes clustering algorithm is the most widely used to group the categorical data, because it is easy to implement and efficient to handle the large amount of data. However, due to its random selection of in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 20 publications
0
4
0
1
Order By: Relevance
“…Distance is either entropy [41] or Hamming distance [43]. Other algorithms have been proposed to improve optimization of the k-modes objective function without repeated reinitialization, including a tabu search [50], a genetic algorithm [51], a partical swarm optimizer [52], an artificial bee colony optimizer [39] and a Cuckoo search [53]. Most global optimizers and some initializers employ the traditional k-modes algorithms within their approach, and hence could benefit from use of the OTQT or OT algorithms.…”
Section: B Resultsmentioning
confidence: 99%
“…Distance is either entropy [41] or Hamming distance [43]. Other algorithms have been proposed to improve optimization of the k-modes objective function without repeated reinitialization, including a tabu search [50], a genetic algorithm [51], a partical swarm optimizer [52], an artificial bee colony optimizer [39] and a Cuckoo search [53]. Most global optimizers and some initializers employ the traditional k-modes algorithms within their approach, and hence could benefit from use of the OTQT or OT algorithms.…”
Section: B Resultsmentioning
confidence: 99%
“…K-Modes merupakan algoritma clustering yang sering digunakan untuk mengelompokan data kategorik karena mudah diimplementasikan dan efisien untuk menangani data dalam jumlah besar [15]. Algoritma k-Modes menjamin konvergensi, artinya algoritma tersebut dapat memberikan hasil yang pasti.…”
Section: Pendahuluanunclassified
“…Accuracy: Accuracy refers to a measure of how well a model's predictions match the modeled reality. The classifi cation accuracy of the test data can be calculated by dividing the number of correctly classifi ed objects by the total number of objects (Lakshmi, K., Visalakshi, N. K., Shanthi, S., & Parvathavarthini, S., 2017;Tatbul, N., Lee, T. J., Zdonik, S., Alam, M., & Gottschlich, J., 2018).…”
Section: Fig 5: An Example Of a Pattern Generated By A Genetic Algori...mentioning
confidence: 99%