2007
DOI: 10.1145/1345448.1345461
|View full text |Cite
|
Sign up to set email alerts
|

A classical predictive modeling approach for task "Who rated what?" of the KDD CUP 2007

Abstract: This paper describes one possible way to solve task "Who rated what?" of the KDD CUP 2007. The proposed solution is a history-based model that predicts whether a user will vote a given movie. Key points to our approach are (1) the estimation of the model baseline, (2) the definition of the explanatory variables and (3) the mathematical model form. Given the binary outcome of the problem, the estimation of the true baseline (ratio of 1's in the test data) is critical in order to correctly make predictions. In p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2011
2011
2014
2014

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(9 citation statements)
references
References 3 publications
0
9
0
Order By: Relevance
“…One interesting hybrid method is to improve Top-N recommendations by combining recommendation results from several methods by formulating it as an optimization problem [11]. Logistic Regression is one popular model to optimize the predictions based on more than one predictor [12], [13], [14].…”
Section: A Related Workmentioning
confidence: 99%
“…One interesting hybrid method is to improve Top-N recommendations by combining recommendation results from several methods by formulating it as an optimization problem [11]. Logistic Regression is one popular model to optimize the predictions based on more than one predictor [12], [13], [14].…”
Section: A Related Workmentioning
confidence: 99%
“…Typical example of each type is shown in the right part of the figure. A success recommendation should contain type A B and D. We choose user/item-based Pearson Correlation Coefficient (PCC) [4,29], Aspect Model (AM) [16], PMF [28] and EigenRank [20] as quality-based algorithms; and within relevance-based algorithms, an association-based method [7] and a hitting-frequency-based method [33] are chosen. We randomly choose 40,000 users for training and 10,000 users (given their first 10 ratings) for testing.…”
Section: Quality Vs Relevancementioning
confidence: 99%
“…Within relevance-based algorithms, we choose an associationbased method [7] and a hitting-frequency-based method [33]. Association and hitting frequency have been demonstrated as two competitive features in relevance-based approaches.…”
Section: Rationale Of Basic Approaches Selectionmentioning
confidence: 99%
See 2 more Smart Citations