Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval 2009
DOI: 10.1145/1571941.1572018
|View full text |Cite
|
Sign up to set email alerts
|

Score adjustment for correction of pooling bias

Abstract: Information retrieval systems are evaluated against test collections of topics, documents, and assessments of which documents are relevant to which topics. Documents are chosen for relevance assessment by pooling runs from a set of existing systems. New systems can return unassessed documents, leading to an evaluation bias against them. In this paper, we propose to estimate the degree of bias against an unpooled system, and to adjust the system's score accordingly. Bias estimation can be done via leave-one-out… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
30
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 29 publications
(32 citation statements)
references
References 18 publications
2
30
0
Order By: Relevance
“…Note that we are questioning the reusability of existing test collections: thus, "on-line" methods such as one that monitors the reusability of a test collection while building it [4] is outside the scope of this study. Also, while a score adjustment approach [29] may be useful for evaluating non-contributors accurately, this is also beyond our scope as it requires some new relevance assessments for the non-contributors.…”
Section: Handling Incompletenessmentioning
confidence: 99%
“…Note that we are questioning the reusability of existing test collections: thus, "on-line" methods such as one that monitors the reusability of a test collection while building it [4] is outside the scope of this study. Also, while a score adjustment approach [29] may be useful for evaluating non-contributors accurately, this is also beyond our scope as it requires some new relevance assessments for the non-contributors.…”
Section: Handling Incompletenessmentioning
confidence: 99%
“…Weber and Park [1] estimate the bias that the uniform pooling and incomplete judgments introduce when un-judged documents are considered as non-relevant and when they are simply omitted from the computation of the performance scores. For each participating system they consider the discrepancy in a system's performance score when the pool first excludes and then includes documents uniquely retrieved by that system.…”
Section: Related Workmentioning
confidence: 99%
“…Weber and Park [1] partially address that issue by considering a more precise error estimation based on a set of common topics for which existing systems and a new one are fully assessed. By removing the uncertainty of the un-judged documents they propose an adjusted estimator that can be extrapolated to new topics and new systems.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations