2006
DOI: 10.1007/s10994-005-5316-9
|View full text |Cite
|
Sign up to set email alerts
|

A Unified View on Clustering Binary Data

Abstract: Abstract.Clustering is the problem of identifying the distribution of patterns and intrinsic correlations in large data sets by partitioning the data points into similarity classes. This paper studies the problem of clustering binary data. Binary data have been occupying a special place in the domain of data analysis. A unified view of binary data clustering is presented by examining the connections among various clustering criteria. Experimental studies are conducted to empirically verify the relationships.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2007
2007
2024
2024

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 37 publications
(24 citation statements)
references
References 33 publications
(28 reference statements)
0
24
0
Order By: Relevance
“…Given two binary data points z 1 and z 2 , there are four fundamental quantities that can be used to define similarity between the two [35]:…”
Section: Outlier Handlingmentioning
confidence: 99%
“…Given two binary data points z 1 and z 2 , there are four fundamental quantities that can be used to define similarity between the two [35]:…”
Section: Outlier Handlingmentioning
confidence: 99%
“…A unified view of binary data clustering has been provided by examining the connections among various methods including entropy-based methods, distance-based methods (e.g., K-means), mixture models, and matrix decomposition [38,39]. In addition, it also shows the equivalence between K-means clustering methods with many other methods on binary data clustering using empirical studies [38,39]. In our experiments, we use K-means as the clustering methods.…”
Section: Methodsmentioning
confidence: 99%
“…For the solution of clustering problem the traditional algorithms, such as k-means algorithm [19,20], hierarchical clustering, differential evaluation algorithm, particle swarm optimization algorithm, artificial bee colony optimization, ant colony algorithm, and neural network algorithm GEM (Gaussian expectation-maximization), are usually used [21][22][23][24][25][26]. The up-to-date survey of evolutionary algorithms for clustering, especially the partition algorithms, are described in detail in [27].…”
Section: Related Workmentioning
confidence: 99%