A Stochastic Treatment of Learning to Rank Scoring Functions

Bruch, Sebastian; Han, Shuguang; Bendersky, Michael; Najork, Marc

doi:10.1145/3336191.3371844

Cited by 55 publications

(50 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recently, Bruch et al [4] demonstrate that learning to rank models can be optimized towards expected values of relevance metrics computed over multiple rankings sampled based on estimated relevance. While not developed in the context of deploying a stochastic ranker, we adopt some of the methodologies therein in our experiments.…”

Section: Stochastic Rankingmentioning

confidence: 99%

“…Unfortunately, this sampling process is non-differentiable and, therefore, prohibitive to a large class of models, including those that learn by gradient descent. We address this by adopting the method proposed by Bruch et al [4]. To construct a sampled ranking , we reparameterize the probability distribution by adding independently drawn noise samples from the Gumbel distribution [23] to y and sorting items by the "noisy" probability distributioñ ,˜(…”

Section: Algorithmmentioning

confidence: 99%

See 1 more Smart Citation

Evaluating Stochastic Rankings with Expected Exposure

Dı́az

Mitra

Ekstrand

et al. 2020

Proceedings of the 29th ACM International Conference on Information &Amp; Knowledge Management

131

140

View full text Add to dashboard Cite

We introduce the concept of expected exposure as the average attention ranked items receive from users over repeated samples of the same query. Furthermore, we advocate for the adoption of the principle of equal expected exposure: given a fixed information need, no item should receive more or less expected exposure than any other item of the same relevance grade. We argue that this principle is desirable for many retrieval objectives and scenarios, including topical diversity and fair ranking. Leveraging user models from existing retrieval metrics, we propose a general evaluation methodology based on expected exposure and draw connections to related metrics in information retrieval evaluation. Importantly, this methodology relaxes classic information retrieval assumptions, allowing a system, in response to a query, to produce a distribution over rankings instead of a single fixed ranking. We study the behavior of the expected exposure metric and stochastic rankers across a variety of information access conditions, including ad hoc retrieval and recommendation. We believe that measuring and optimizing expected exposure metrics using randomization opens a new area for retrieval algorithm development and progress. CCS CONCEPTS • Information systems → Evaluation of retrieval results; Learning to rank.

show abstract

Section: Stochastic Rankingmentioning

confidence: 99%

Section: Algorithmmentioning

confidence: 99%

Evaluating Stochastic Rankings with Expected Exposure

Dı́az

Mitra

Ekstrand

et al. 2020

Proceedings of the 29th ACM International Conference on Information &Amp; Knowledge Management

131

140

View full text Add to dashboard Cite

show abstract

“…It is noteworthy, that enumerating all distinct document permutations can be computationally challenging even for a moderately sized set of candidates. More recently, Bruch et al [418] demonstrated a mechanism for sampling rankings from the Plackett-Luce distribution using the reparameterization trick [395] that is amenable to gradient-based optimization. Their approach involves adding independently drawn noise samples from the Gumbel distribution [419] and then deriving the approximate rank of the document following the method proposed by Qin et al [420] and Wu et al [421].…”

Section: Related Workmentioning

confidence: 99%

Neural methods for effective, efficient, and exposure-aware information retrieval

Mitra

2021

SIGIR Forum

View full text Add to dashboard Cite

Neural networks with deep architectures have demonstrated significant performance improvements in computer vision, speech recognition, and natural language processing. The challenges in information retrieval (IR), however, are different from these other application areas. A common form of IR involves ranking of documents---or short passages---in response to keyword-based queries. Effective IR systems must deal with query-document vocabulary mismatch problem, by modeling relationships between different query and document terms and how they indicate relevance. Models should also consider lexical matches when the query contains rare terms---such as a person's name or a product model number---not seen during training, and to avoid retrieving semantically related but irrelevant results. In many real-life IR tasks, the retrieval involves extremely large collections---such as the document index of a commercial Web search engine---containing billions of documents. Efficient IR methods should take advantage of specialized IR data structures, such as inverted index, to efficiently retrieve from large collections. Given an information need, the IR system also mediates how much exposure an information artifact receives by deciding whether it should be displayed, and where it should be positioned, among other results. Exposure-aware IR systems may optimize for additional objectives, besides relevance, such as parity of exposure for retrieved items and content publishers. In this thesis, we present novel neural architectures and methods motivated by the specific needs and challenges of IR tasks. We ground our contributions with a detailed survey of the growing body of neural IR literature [Mitra and Craswell, 2018]. Our key contribution towards improving the effectiveness of deep ranking models is developing the Duet principle [Mitra et al., 2017] which emphasizes the importance of incorporating evidence based on both patterns of exact term matches and similarities between learned latent representations of query and document. To efficiently retrieve from large collections, we develop a framework to incorporate query term independence [Mitra et al., 2019] into any arbitrary deep model that enables large-scale precomputation and the use of inverted index for fast retrieval. In the context of stochastic ranking, we further develop optimization strategies for exposure-based objectives [Diaz et al., 2020]. Finally, this dissertation also summarizes our contributions towards benchmarking neural IR models in the presence of large training datasets [Craswell et al., 2019] and explores the application of neural methods to other IR tasks, such as query auto-completion.

show abstract

“…For our experiments, we use two public learning-to-rank datasets with numerical features, and two large-scale proprietary datasets. 3 We discard queries with no relevant documents, similar to evaluation in [4]. ick Access.…”

Section: Experiments 51 Datasetsmentioning

confidence: 99%

“…On the public datasets, we compare attn-DIN with the RankLib 4 and LightGBM [10] implementations of LambdaMART, and state-of-theart neural ranking algorithms: SetRank [14], Deep Listwise Context Model (DLCM) [1], Groupwise Scoring Functions (GSF) [2], and Feed-Forward Neural Network (with ReLU activations) with Gumbel Approximate NDCG loss [3]. We tune the hyperparameters of LightGBM and set both the number of iteration and the number of leaves to be 2,000 for WEB30K and 500 for Istella.…”

Section: Baselinesmentioning

confidence: 99%

Permutation Equivariant Document Interaction Network for Neural Learning to Rank

Pasumarthi

Zhuang

Wang

et al. 2020

Proceedings of the 2020 ACM SIGIR on International Conference on Theory of Information Retrieval

Self Cite

View full text Add to dashboard Cite

How to leverage cross-document interactions to improve ranking performance is an important topic in information retrieval research. The recent developments in deep learning show strength in modeling complex relationships across sequences and sets. It thus motivates us to study how to leverage cross-document interactions for learning-to-rank in the deep learning framework. In this paper, we formally de ne the permutation equivariance requirement for a scoring function that captures cross-document interactions. We then propose a self-attention based document interaction network that extends any univariate scoring function with contextual features capturing cross-document interactions. We show that it satis es the permutation equivariance requirement, and can generate scores for document sets of varying sizes. Our proposed methods can automatically learn to capture document interactions without any auxiliary information, and can scale across large document sets. We conduct experiments on four ranking datasets: the public benchmarks WEB30K and Istella, as well as Gmail search and Google Drive Quick Access datasets. Experimental results show that our proposed methods lead to signi cant quality improvements over state-of-the-art neural ranking models, and are competitive with state-of-the-art gradient boosted decision tree (GBDT) based models on the WEB30K dataset. CCS CONCEPTS • Information systems → Learning to rank.

show abstract

A Stochastic Treatment of Learning to Rank Scoring Functions

Cited by 55 publications

References 30 publications

Evaluating Stochastic Rankings with Expected Exposure

Evaluating Stochastic Rankings with Expected Exposure

Neural methods for effective, efficient, and exposure-aware information retrieval

Permutation Equivariant Document Interaction Network for Neural Learning to Rank

Contact Info

Product

Resources

About