Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries 2016
DOI: 10.1145/2910896.2910907
|View full text |Cite
|
Sign up to set email alerts
|

Querylog-based Assessment of Retrievability Bias in a Large Newspaper Corpus

Abstract: Bias in the retrieval of documents can directly influence the information access of a digital library. In the worst case, systematic favoritism for a certain type of document can render other parts of the collection invisible to users. This potential bias can be evaluated by measuring the retrievability for all documents in a collection. Previous evaluations have been performed on TREC collections using simulated query sets. The question remains, however, how representative this approach is of more realistic s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
25
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 20 publications
(28 citation statements)
references
References 14 publications
2
25
1
Order By: Relevance
“…BM25 induces the smallest inequality for both query sets and can therefore be considered to be the fairest model. This is in line with the findings of [7,43]. For each setup a number of documents in the collection are never retrieved by any retrieval model (r (d) = 0).…”
Section: Retrievability Biassupporting
confidence: 88%
“…BM25 induces the smallest inequality for both query sets and can therefore be considered to be the fairest model. This is in line with the findings of [7,43]. For each setup a number of documents in the collection are never retrieved by any retrieval model (r (d) = 0).…”
Section: Retrievability Biassupporting
confidence: 88%
“…Information retrieval researchers have coined the term retrievability, which refers to how accessible a document is in a system [4]. When a system systematically favors documents with particular characteristics, such that their retrievability is higher than that of others, the system exhibits a retrievability bias [44]. In a search for "person" images, it is clear that photos of men have significantly greater retrievability as compared to photos of women.…”
Section: Analysis Who Represents a "Person"?mentioning
confidence: 99%
“…In [7], different methods of retrieval have been evaluated for their performance under standalone and combined way. Toward corpus retrieval of newspapers, an log based approach is presented in [8]. The retrieve ability measure is computed and evaluated towards performance.…”
Section: Literature Reviewmentioning
confidence: 99%