2009
DOI: 10.1007/s10489-009-0160-4
|View full text |Cite
|
Sign up to set email alerts
|

On combining multiple clusterings: an overview and a new perspective

Abstract: Many problems can be reduced to the problem of combining multiple clusterings. In this paper, we first summarize different application scenarios of combining multiple clusterings and provide a new perspective of viewing the problem as a categorical clustering problem. We then show the connections between various consensus and clustering criteria and discuss the complexity results of the problem. Finally we propose a new method to determine the final clustering. Experiments on kinship terms and clustering popul… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 29 publications
(20 citation statements)
references
References 54 publications
0
20
0
Order By: Relevance
“…The combination of base clusterings can be constructed by three kinds of method: the consensus functions [Strehl and Ghosh 2002], the categorical clusterings [Gionis et al 2007], and the direct optimizations [Christou 2011]. The consensus functions focus on the total agreement of all the base clusterings from different perspectives [Li et al 2010]. The clustering ensemble can also be converted to the problem of clustering categorical data (categorical clustering [Guha et al 2000;Andritsos et al 2004], for short) by viewing each attribute as a way of producing a base clustering of the data.…”
Section: Process Of Clustering Ensemblementioning
confidence: 99%
See 1 more Smart Citation
“…The combination of base clusterings can be constructed by three kinds of method: the consensus functions [Strehl and Ghosh 2002], the categorical clusterings [Gionis et al 2007], and the direct optimizations [Christou 2011]. The consensus functions focus on the total agreement of all the base clusterings from different perspectives [Li et al 2010]. The clustering ensemble can also be converted to the problem of clustering categorical data (categorical clustering [Guha et al 2000;Andritsos et al 2004], for short) by viewing each attribute as a way of producing a base clustering of the data.…”
Section: Process Of Clustering Ensemblementioning
confidence: 99%
“…We can construct consensus functions by the following approaches: direct best matching [Li et al 2010], graph-based mappings [Strehl and Ghosh 2002;Fern and Brodley 2004], statistical mixture models [Topchy et al 2005], pairwise comparisons [Gionis et al 2007;Li et al 2010] and a number of other models. They are all built on the co-associations or pairwise agreements between clusterings (e.g., partition difference PD [Li et al 2010] that focuses on the similarity between partitions and QMI [Topchy et al 2005] that works on the consensus function based on quadratic mutual information), between data objects (e.g., CSPA 1 [Strehl and Ghosh 2002]) that induces a similarity measure from partitions and re-clusters objects, or between clusters (e.g., MCLA [Strehl and Ghosh 2002] that collapses groups of clusters into meta-clusters and competes for each object to determine the combined clustering). While the clustering ensemble based on consensus functions largely captures the common structure of the base clusterings, and achieves a combined clustering with better quality than individual clusterings, it also faces several issues that have not been explored well in the consensus design.…”
Section: Process Of Clustering Ensemblementioning
confidence: 99%
“…We thus propose a disparate clustering algorithm to find the design alternatives (as shown in Algorithm I). Multiple clustering can be implemented using different attributes in K-means or by changing the cluster merging rules in hierarchical clustering [20]. We can use a contingency table to capture the relationships between members in BOM clusters and EI clusters.…”
Section: Finding Sustainable Design Alternatives Based On Multiplementioning
confidence: 99%
“…The more two objects or clusters resemble each other, the larger is the similarity [5]. The other similarity between attributes [6] can also be converted into the difference of similarities between pairwise attribute values [7]. Therefore, the similarity between attribute values plays a fundamental role in similarity analysis.…”
Section: Introductionmentioning
confidence: 99%