Proceedings of the 37th International ACM SIGIR Conference on Research &Amp; Development in Information Retrieval 2014
DOI: 10.1145/2600428.2609529
|View full text |Cite
|
Sign up to set email alerts
|

A POMDP model for content-free document re-ranking

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(6 citation statements)
references
References 11 publications
0
6
0
Order By: Relevance
“…As ranking is a key issue in practical recommendation problems, any improvements in ranking contribute significantly to reinforcement recommendation systems. Zhang et al [ 30 ] used a log-based document reranking modeled as a POMDP. Wei et al [ 6 ] proposed a novel LTR model based on a MDP, referred to as MDPRank, which directly optimizes a ranking using a MDP.…”
Section: Related Workmentioning
confidence: 99%
“…As ranking is a key issue in practical recommendation problems, any improvements in ranking contribute significantly to reinforcement recommendation systems. Zhang et al [ 30 ] used a log-based document reranking modeled as a POMDP. Wei et al [ 6 ] proposed a novel LTR model based on a MDP, referred to as MDPRank, which directly optimizes a ranking using a MDP.…”
Section: Related Workmentioning
confidence: 99%
“…Similar to our modeling approach, multi-armed bandits have been utilized to model user preferences by learning diverse rankings for a single query based on clicking behavior [56,88] and learning rankings from pair-wise document comparisons derived from implicit feedback [52,126]. Other similar techniques include POMDPs that have been recently proposed for re-ranking [128] and session search [74].…”
Section: Complex Tasks and Search Personalizationmentioning
confidence: 99%
“…One line is session search. In [11,20], a collaborative search process between the user and the search engine is defined as an MDP. The other line mainly focuses on reinforcement learning to rank.…”
Section: Background 21 Related Workmentioning
confidence: 99%