2012
DOI: 10.1016/j.knosys.2012.01.006
|View full text |Cite
|
Sign up to set email alerts
|

A fuzzy k-prototype clustering algorithm for mixed numeric and categorical data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
59
0
1

Year Published

2015
2015
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 141 publications
(60 citation statements)
references
References 25 publications
0
59
0
1
Order By: Relevance
“…Moreover, we also carried out experiments using the traditional K-Means algorithm on the data sets containing solely numerical features. We did not include in our simulations other popular 255 clustering algorithms capable of dealing with data sets containing categorical data, such as k-prototype, SBAC and KL-FCM-GM [19, 10,17], because WK-DC was already shown to outperform all of them [18]. Table 2 shows the results of our experiments in terms of the adjusted Rand index (ARI) and the number of completed iterations necessary for convergence (Itr).…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…Moreover, we also carried out experiments using the traditional K-Means algorithm on the data sets containing solely numerical features. We did not include in our simulations other popular 255 clustering algorithms capable of dealing with data sets containing categorical data, such as k-prototype, SBAC and KL-FCM-GM [19, 10,17], because WK-DC was already shown to outperform all of them [18]. Table 2 shows the results of our experiments in terms of the adjusted Rand index (ARI) and the number of completed iterations necessary for convergence (Itr).…”
Section: Resultsmentioning
confidence: 99%
“…Such a diversification of clustering algorithms can be explained by a variety of different ways in which data groups, nor-10 mally referred to as clusters, may be formed. Some algorithms allow a given entity to belong to two or more clusters, sometimes even with different degrees of membership, but most of them allow it to belong to a single cluster only [9,10]. In this paper, we are particularly interested in the latter approach, including data partitioning algorithms with a crisp membership.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Nonetheless, using hard clustering for GDA often leads to the issues of ecological fallacy, which can be shortly understood that statistics accurately describing group characteristics do not necessarily apply to individuals within that group. For this fact, Fuzzy C-Means (FCM) and its variants were considered as the appropriate methods to determine the distribution of a demographic feature on a map as described in some articles such as [1,5,10,[12][13][14][15][16][17][18][19]. Since the results of FCM are independent to the geographical factors, some improvements of that algorithm were made by attaching FCM with a spatial model such as SIM in [3] and SIM-PF in [7,16,18].…”
Section: Introductionmentioning
confidence: 99%