2019
DOI: 10.17559/tv-20190109015453
|View full text |Cite
|
Sign up to set email alerts
|

CUBOS: An Internal Cluster Validity Index for Categorical Data

Abstract: Internal cluster validity index is a powerful tool for evaluating clustering performance. The study on internal cluster validity indices for categorical data has been a challenging task due to the difficulty in measuring distance between categorical attribute values. While some efforts have been made, they ignore the relationship between different categorical attribute values and the detailed distribution information between data objects. To solve these problems, we propose a novel index called Categorical dat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(7 citation statements)
references
References 22 publications
0
7
0
Order By: Relevance
“…We first introduce the case dataset used. Then, NC-based method and CAMER are employed to measure the evaluation abilities of seven popular external CVIs [17] to internal CVIs respectively. Finally, we summarize the differences between the two methods, to clarify the superiority of CAMER.…”
Section: Comparison Of Camer and Nc-based Methodsmentioning
confidence: 99%
“…We first introduce the case dataset used. Then, NC-based method and CAMER are employed to measure the evaluation abilities of seven popular external CVIs [17] to internal CVIs respectively. Finally, we summarize the differences between the two methods, to clarify the superiority of CAMER.…”
Section: Comparison Of Camer and Nc-based Methodsmentioning
confidence: 99%
“…The publication trend shows an increase, with 2019 emerging as the most productive year. In that year, 11 articles related to clustering were published, covering hierarchical-based [47,48], rough-set-based [49,50], weight-based [51], graph-based [52], a variant of fuzzy clustering [53][54][55], integer linear programming [56], and clustering validity [57]. Moreover, for a deeper comprehension of the citations and publications spanning the period from 2014 to 2023, Table 3 presents the top ten cited articles.…”
Section: • Publication Yearsmentioning
confidence: 99%
“…Table 15 shows the summary of cluster validity. Ng's K-modes [143,220], K-modes [15,221], WKM [176] Bai & Liang (2015) generalized validity function [65] K-modes, CU [222], IE [102] Gao & Wu (2019) IDC, CUBOS [57] CCI [223], CDCS [224], IE, CU, NCC [225] All the validity functions in this study focus on internal validity functions. In 2014, Bai and Liang [39] improved the K-modes algorithm by optimizing its objective function to incorporate both between-cluster separation and within-cluster compactness.…”
Section: Validity Functionmentioning
confidence: 99%
See 2 more Smart Citations