2004
DOI: 10.1145/963770.963772
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating collaborative filtering recommender systems

Abstract: Recommender systems have been evaluated in many, often incomparable, ways. In this article, we review the key decisions in evaluating collaborative filtering recommender systems: the user tasks being evaluated, the types of analysis and datasets being used, the ways in which prediction quality is measured, the evaluation of prediction attributes other than quality, and the user-based evaluation of the system as a whole. In addition to reviewing the evaluation strategies used by prior researchers, we present em… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

14
2,751
5
100

Year Published

2010
2010
2014
2014

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 4,669 publications
(2,992 citation statements)
references
References 20 publications
14
2,751
5
100
Order By: Relevance
“…Users have to move an arrow showing their degree of similarity with the personality characteristics of the characters being presented (these personality characteristics along with examples of typical behaviour patterns are presented under each character's image). In [13] we concluded that it is possible to replace the TKI personality test with the movie metaphor test because it provides an statistically The results that the TKI test provided for the five different personality modes in comparison with the values that the movie metaphor test gathered had a Mean Absolute Error (MAE) [52] of 0.12. Hence, we believe that it is worth sacrificing a little accuracy in the test results (as they are not for psychology testing purposes) in exchange of enhancing significantly the usability and interest for the application.…”
Section: Social Knowledge Elicitation Modulesmentioning
confidence: 99%
“…Users have to move an arrow showing their degree of similarity with the personality characteristics of the characters being presented (these personality characteristics along with examples of typical behaviour patterns are presented under each character's image). In [13] we concluded that it is possible to replace the TKI personality test with the movie metaphor test because it provides an statistically The results that the TKI test provided for the five different personality modes in comparison with the values that the movie metaphor test gathered had a Mean Absolute Error (MAE) [52] of 0.12. Hence, we believe that it is worth sacrificing a little accuracy in the test results (as they are not for psychology testing purposes) in exchange of enhancing significantly the usability and interest for the application.…”
Section: Social Knowledge Elicitation Modulesmentioning
confidence: 99%
“…CBF approaches and CF algorithms have both been used fairly successfully to build recommendation systems in various domains [11][12][13]16]. However, as described above, they suffer from the cold-start problem in one form or another.…”
Section: Related Workmentioning
confidence: 99%
“…Here, we first use Rank Score [15] to measure the ability of a recommendation algorithm to produce a good ordering of items that matches the user's preference. Since real users usually consider only the top part of the recommendation list, two more practical measures are also be employed, namely Precision and Recall [20].…”
Section: Data and Metricsmentioning
confidence: 99%