Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2007
DOI: 10.1145/1281192.1281222
|View full text |Cite
|
Sign up to set email alerts
|

A framework for simultaneous co-clustering and learning from complex data

Abstract: For difficult classification or regression problems, practitioners often segment the data into relatively homogenous groups and then build a model for each group. This two-step procedure usually results in simpler, more interpretable and actionable models without any loss in accuracy. We consider problems such as predicting customer behavior across products, where the independent variables can be naturally partitioned into two groups. A pivoting operation can now result in the dependent variable showing up as … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
82
0
3

Year Published

2009
2009
2023
2023

Publication Types

Select...
3
3
3

Relationship

2
7

Authors

Journals

citations
Cited by 58 publications
(85 citation statements)
references
References 22 publications
0
82
0
3
Order By: Relevance
“…Deodhar and Ghosh (2007) 1 stated that researchers most often do partitioning a priori based on domain knowledge or a separate segmentation routine.…”
Section: Introductionmentioning
confidence: 99%
“…Deodhar and Ghosh (2007) 1 stated that researchers most often do partitioning a priori based on domain knowledge or a separate segmentation routine.…”
Section: Introductionmentioning
confidence: 99%
“…Two of these approaches are SCOAL (Simultaneous Co-clustering and Learning) [8] and PDLF (Predictive Discrete Latent Factor Modeling) [4]. Both approaches partition the users and items into a grid of blocks (co-clusters) of related users and items, while simultaneously learning a predictive model on each co-cluster.…”
Section: Related Workmentioning
confidence: 99%
“…The organic emergence of these predictive models is coupled with the formation of the co-clusters that define the domain of the models. Such coupling of the models and co-clusters improves both the interpretability and the accuracy when modeling predictively heterogeneous dyadic datasets, as this mechanism can effectively exploit both local neighborhood patterns as well as the globally available attributes [8,4].…”
Section: Related Workmentioning
confidence: 99%
“…Clustering on such data is useful in many applications including product recommendations, customer and product segmentation, and identifying various customer and market trends. However, typically only a small subset of customers show statistically significant coherent buying behavior and that too when one focuses only a small subset of products [Strehl and Ghosh 2003;Deodhar and Ghosh 2007;Wedel and Steenkamp 1991]. Therefore, a clustering algorithm for such datasets should have the ability to prune out (potentially large) sparse and noisy portions of the data to uncover the highly coherent clusters.…”
Section: Introductionmentioning
confidence: 99%