2018
DOI: 10.1007/s11634-018-0325-2
|View full text |Cite
|
Sign up to set email alerts
|

Unifying data units and models in (co-)clustering

Abstract: Statisticians are already aware that any modelling process issue (exploration, prediction) is wholly data unit dependent, to the extend that it should be impossible to provide a statistical outcome without specifying the couple (unit,model). In this work, this general principle is formalized with a particular focus in model-based clustering and co-clustering in the case of possibly mixed data types (continuous and/or categorical and/or counting features), being also the opportunity to revisit what the related … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 54 publications
0
2
0
Order By: Relevance
“…Furthermore, as noted in Section 4.1, the way the data is encoded can have a strong impact on the resulting co-clustering partition. Although there are ways to address the matter in some cases, as detailed in [37], the user should be aware of it. Additionally, the influence of each kind of feature on the resulting row partitions is to be investigated more deeply in a future work.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Furthermore, as noted in Section 4.1, the way the data is encoded can have a strong impact on the resulting co-clustering partition. Although there are ways to address the matter in some cases, as detailed in [37], the user should be aware of it. Additionally, the influence of each kind of feature on the resulting row partitions is to be investigated more deeply in a future work.…”
Section: Discussionmentioning
confidence: 99%
“…Consequently they won't be grouped together in a similar column cluster, whereas a simple switch in the order of the levels could change this and lead to grouping these variables together. Note that this problem is not specific to co-clustering and is also present in clustering [37]. While the user should be aware that the results are conditional on the encoding of levels, this is not an issue addressed in this work.…”
Section: Modeling Nominal Datamentioning
confidence: 99%