Top-K Off-Policy Correction for a REINFORCE Recommender System

Chen, Minmin; Beutel, Alex; Covington, Paul; Jain, Sahil; Belletti, Francois; H., Ed

doi:10.1145/3289600.3290999

Cited by 341 publications

(309 citation statements)

References 38 publications

Supporting

Mentioning

309

Contrasting

Order By: Relevance

“…Building real-world recommenders face a variety of challenges. Two that relate to the challenges in fairness are the temporal dynamics [33,48,26,9] and biased training data [29,15,3]. These issues do not just make training difficult but also evaluation of recommender performance [42].…”

Section: Related Workmentioning

confidence: 99%

“…We consider a production recommender system that is recommending a personalized list of K items to users. We consider a cascading recommender [47,24,16], with a set of retrieval systems [15] followed by a ranking system [16,36]. We assume that the retrieval systems return a set R of M relevant items from the total corpus J of M items, where M M ≥ K. The ranking model must then score and rank M items in R to get a final ranking of K items.…”

Section: Recommendation Environmentmentioning

confidence: 99%

See 1 more Smart Citation

Fairness in Recommendation Ranking through Pairwise Comparisons

Beutel

Chen

Doshi

et al. 2019

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining

Self Cite

288

226

View full text Add to dashboard Cite

Recommender systems are one of the most pervasive applications of machine learning in industry, with many services using them to match users to products or information. As such it is important to ask: what are the possible fairness risks, how can we quantify them, and how should we address them?In this paper we offer a set of novel metrics for evaluating algorithmic fairness concerns in recommender systems. In particular we show how measuring fairness based on pairwise comparisons from randomized experiments provides a tractable means to reason about fairness in rankings from recommender systems. Building on this metric, we offer a new regularizer to encourage improving this metric during model training and thus improve fairness in the resulting rankings. We apply this pairwise regularization to a large-scale, production recommender system and show that we are able to significantly improve the system's pairwise fairness.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Recommendation Environmentmentioning

confidence: 99%

Fairness in Recommendation Ranking through Pairwise Comparisons

Beutel

Chen

Doshi

et al. 2019

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining

Self Cite

288

226

View full text Add to dashboard Cite

show abstract

“…Other enhancements include incorporating contextual data [5]. Most recently, Chen et al [10] and Ie et al [23] showed success in applying reinforcement learning techniques in YouTube recommender systems. Our work does not deal with designing a recommender system, nor does it attempt to reverse engineer the YouTube recommender.…”

Section: Recommender Systems and Video Recommendationmentioning

confidence: 99%

“…Contrasting the extensive literature on evaluating the accuracy of recommendation [5,10,30,54], we focus on prior work that connects network structure with content consumption. Carmi et al [8] reported how the book sales on Amazon react to exogenous demand shocks -not only did the sales increase for the featured item, but the increase also propagated a few hops away by following the links created by the recommender systems.…”

Section: Measuring the Effects Of Recommender Systemsmentioning

confidence: 99%

“…The first gap measures and estimates the effects of recommender systems in complex social systems. The main goals of recommender systems are maximizing the chance that a user clicks on an item in the next step [4,16,17,48] or in a longer time horizon [5,10,23]. However, recommendation in social systems remains as an open problem for two reasons: (1) a limited conceptual understanding of how finite human attention is allocated over the network of content, in which some items gain popularity at the expense of, or with the assistance of others; (2) the computational challenge of jointly recommending a large collection of items.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Estimating Attention Flow in Online Video Networks

Rizoiu

Xie

2019

Proc. ACM Hum.-Comput. Interact.

View full text Add to dashboard Cite

Online videos have shown tremendous increase in Internet traffic. Most video hosting sites implement recommender systems, which connect the videos into a directed network and conceptually act as a source of pathways for users to navigate. At present, little is known about how human attention is allocated over such large-scale networks, and about the impacts of the recommender systems. In this paper, we first construct the Vevo network -a YouTube video network with 60,740 music videos interconnected by the recommendation links, and we collect their associated viewing dynamics. This results in a total of 310 million views every day over a period of 9 weeks. Next, we present large-scale measurements that connect the structure of the recommendation network and the video attention dynamics. We use the bow-tie structure to characterize the Vevo network and we find that its core component (23.1% of the videos), which occupies most of the attention (82.6% of the views), is made out of videos that are mainly recommended among themselves. This is indicative of the links between video recommendation and the inequality of attention allocation. Finally, we address the task of estimating the attention flow in the video recommendation network. We propose a model that accounts for the network effects for predicting video popularity, and we show it consistently outperforms the baselines. This model also identifies a group of artists gaining attention because of the recommendation network. Altogether, our observations and our models provide a new set of tools to better understand the impacts of recommender systems on collective social attention.

show abstract

Value and Impact of Recommender Systems

Jannach

Zanker

2012

Recommender Systems Handbook

View full text Add to dashboard Cite

Top-K Off-Policy Correction for a REINFORCE Recommender System

Cited by 341 publications

References 38 publications

Fairness in Recommendation Ranking through Pairwise Comparisons

Fairness in Recommendation Ranking through Pairwise Comparisons

Estimating Attention Flow in Online Video Networks

Value and Impact of Recommender Systems

Contact Info

Product

Resources

About