2022
DOI: 10.1007/s00357-021-09409-1
|View full text |Cite
|
Sign up to set email alerts
|

Model Selection Strategies for Determining the Optimal Number of Overlapping Clusters in Additive Overlapping Partitional Clustering

Abstract: In various scientific fields, researchers make use of partitioning methods (e.g., K-means) to disclose the structural mechanisms underlying object by variable data. In some instances, however, a grouping of objects into clusters that are allowed to overlap (i.e., assigning objects to multiple clusters) might lead to a better representation of the underlying clustering structure. To obtain an overlapping object clustering from object by variable data, Mirkin’s ADditive PROfile CLUStering (ADPROCLUS) model may b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 107 publications
(155 reference statements)
0
3
0
Order By: Relevance
“…Rossbroich et al (2022) addressed the (underinvestigated) issue of selecting the number of clusters in an overlapping clustering model (ADPROCLUS: Depril et al, 2008; Mirkin, 1987). For this purpose, they proposed and compared 13 model selection strategies, 11 of which were (minor or major) adaptations of similar strategies for partitioning models, and 2 of which were crossvalidation‐based.…”
Section: A Few Illustrative Examplesmentioning
confidence: 99%
See 2 more Smart Citations
“…Rossbroich et al (2022) addressed the (underinvestigated) issue of selecting the number of clusters in an overlapping clustering model (ADPROCLUS: Depril et al, 2008; Mirkin, 1987). For this purpose, they proposed and compared 13 model selection strategies, 11 of which were (minor or major) adaptations of similar strategies for partitioning models, and 2 of which were crossvalidation‐based.…”
Section: A Few Illustrative Examplesmentioning
confidence: 99%
“…For the actual simulations, either own code or existing data generators can be used. Regarding the latter, over the past decades quite a few generators have been proposed, including the Milligan (1985) algorithm for generating artificial test clusters, OCLUS (Steinley & Henson, 2005), the Qiu and Joe (2006) random cluster generation algorithm, and MixSim (Melnykov et al, 2012). In the justification of these generators, quite some emphasis has been put on the aspects of separability and overlap, where overlap refers to intensional rather than to extensional overlap, that is to say, overlap in terms of variables or component distributions, with all generated clusterings being partitions.…”
Section: Issuesmentioning
confidence: 99%
See 1 more Smart Citation