2017
DOI: 10.3390/sym9040058
|View full text |Cite
|
Sign up to set email alerts
|

A Fast K-prototypes Algorithm Using Partial Distance Computation

Abstract: Abstract:The k-means is one of the most popular and widely used clustering algorithm; however, it is limited to numerical data only. The k-prototypes algorithm is an algorithm famous for dealing with both numerical and categorical data. However, there have been no studies to accelerate it. In this paper, we propose a new, fast k-prototypes algorithm that provides the same answers as those of the original k-prototypes algorithm. The proposed algorithm avoids distance computations using partial distance computat… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 7 publications
0
6
0
1
Order By: Relevance
“…For performance evaluation, we compare a basic k-prototypes algorithm (Basic), partial distance computation pruning (PDC) [22] and our two algorithms (KCP and KBP). We examined the performance of our algorithms as the number of objects, the number of clusters, the number of categorical attributes and the number of divisions in each dimension increased.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…For performance evaluation, we compare a basic k-prototypes algorithm (Basic), partial distance computation pruning (PDC) [22] and our two algorithms (KCP and KBP). We examined the performance of our algorithms as the number of objects, the number of clusters, the number of categorical attributes and the number of divisions in each dimension increased.…”
Section: Methodsmentioning
confidence: 99%
“…In order to improve the performance of the k-prototypes algorithm, Kim [22] proposed the concept of partial distance computation (PDC) which compares only partial attributes, not all attributes in measuring distance. The maximum distance that can be measured in one categorical attribute is 1.…”
Section: Existing Pruning Technique In the K-prototypes Algorithmmentioning
confidence: 99%
See 1 more Smart Citation
“…Wangchamhan, Chiewchanwattana and Sunat (2017) endereçaram o trabalho para resolver o problema de inicialiação dos centróides no k-means, considerando a dissimilaridade de Gower para lidar com dados mistos. Kim (2017) propuseram um método para acelerar a execução do algoritmo k-prototype, pela redução do cálculo das distâncias. Özbakır and Turna (2017) modelaram duas meta-heurísticas para endereçar o problema de definição dos grupos.…”
Section: Resultsunclassified
“…However, only a few studies have been conducted to reduce the time complexity of the k-prototypes algorithm. Kim [16] proposed a pruning technique to reduce distance computation between an object and cluster centers using the concept of partial distance computation.…”
Section: Introductionmentioning
confidence: 99%