Reusing historical interaction data for faster online learning to rank for IR

Hofmann, Katja; Schuth, Anne; Whiteson, Shimon; Rijke, Maarten de

doi:10.1145/2433396.2433419

Cited by 94 publications

(109 citation statements)

References 24 publications

Supporting

Mentioning

109

Contrasting

Order By: Relevance

“…The advantage of P-MGD is that it can learn faster by having n, the number of candidates, in Algorithm 1 exceed the length of the result list. Candidate Preselection (CPS) [5], unlike MGD, does not alter the number of candidates compared per impression. It speeds up learning by reusing historical data to select…”

Section: Online Learning To Rank Methodsmentioning

confidence: 99%

“…We show experimentally that P-MGD significantly outperforms state-of-the-art online learning to rank methods in terms of online performance, without sacrificing offline performance and at greater learning speed than those methods. In particular, we include comparisons between P-MGD on the one hand and multiple types of DBGD and multileaved gradient descent methods [14,MGD] and candidate preselection [5,CPS] on the other. We answer the following research questions: (RQ1) Does P-MGD convergence on a ranker of the same quality as MGD and CPS?…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Probabilistic Multileave Gradient Descent

Oosterhuis

Schuth

Rijke

2016

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. Online learning to rank methods aim to optimize ranking models based on user interactions. The dueling bandit gradient descent (DBGD) algorithm is able to effectively optimize linear ranking models solely from user interactions. We propose an extension of DBGD, called probabilistic multileave gradient descent (P-MGD) that builds on probabilistic multileave, a recently proposed highly sensitive and unbiased online evaluation method. We demonstrate that P-MGD significantly outperforms state-of-the-art online learning to rank methods in terms of online performance, without sacrificing offline performance and at greater learning speed.

show abstract

Section: Online Learning To Rank Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Probabilistic Multileave Gradient Descent

Oosterhuis

Schuth

Rijke

2016

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

show abstract

“…DBGD implements a stochastic gradient descent method to find the best ranker in an infinite space of rankers. This algorithm has been extended before by Hofmann et al [11] such that it would reuse historical interaction data and not just live user interactions. Alternative methods are based on the k-armed bandit formulation by [32] and assume a finite space of possible rankers, the best of which needs to be found.…”

Section: Online Learning To Rankmentioning

confidence: 99%

“…A major advantage of PI is that it can also infer preferences between rankers that were not originally interleaved. This allows one to learn from historical interaction data [11]. However, PI risks showing users poor rankings.…”

Section: Evaluation For Information Retrievalmentioning

confidence: 99%

Multileave Gradient Descent for Fast Online Learning to Rank

Schuth

Oosterhuis

Whiteson

et al. 2016

Proceedings of the Ninth ACM International Conference on Web Search and Data Mining

Self Cite

View full text Add to dashboard Cite

Modern search systems are based on dozens or even hundreds of ranking features. The dueling bandit gradient descent (DBGD) algorithm has been shown to effectively learn combinations of these features solely from user interactions. DBGD explores the search space by comparing a possibly improved ranker to the current production ranker. To this end, it uses interleaved comparison methods, which can infer with high sensitivity a preference between two rankings based only on interaction data. A limiting factor is that it can compare only to a single exploratory ranker.We propose an online learning to rank algorithm called multileave gradient descent (MGD) that extends DBGD to learn from so-called multileaved comparison methods that can compare a set of rankings instead of merely a pair. We show experimentally that MGD allows for better selection of candidates than DBGD without the need for more comparisons involving users. An important implication of our results is that orders of magnitude less user interaction data is required to find good rankers when multileaved comparisons are used within online learning to rank. Hence, fewer users need to be exposed to possibly inferior rankers and our method allows search engines to adapt more quickly to changes in user preferences.

show abstract

“…• More advanced methods that build on DBGD such as Probabilistic Multileave Gradient Decent [23,27] and DBGD with Candidate Preselection [12].…”

Section: Part I [10 Minutes] Introduction Aims and Historical Notesmentioning

confidence: 99%

Online Learning to Rank for Information Retrieval

Grotov

Rijke

2016

Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval

Self Cite

View full text Add to dashboard Cite

During the past 10-15 years offline learning to rank has had a tremendous influence on information retrieval, both scientifically and in practice. Recently, as the limitations of offline learning to rank for information retrieval have become apparent, there is increased attention for online learning to rank methods for information retrieval in the community. Such methods learn from user interactions rather than from a set of labeled data that is fully available for training up front.Below we describe why we believe that the time is right for an intermediate-level tutorial on online learning to rank, the objectives of the proposed tutorial, its relevance, as well as more practical details, such as format, schedule and support materials.

show abstract

Reusing historical interaction data for faster online learning to rank for IR

Cited by 94 publications

References 24 publications

Probabilistic Multileave Gradient Descent

Probabilistic Multileave Gradient Descent

Multileave Gradient Descent for Fast Online Learning to Rank

Online Learning to Rank for Information Retrieval

Contact Info

Product

Resources

About