Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining 2020
DOI: 10.1145/3394486.3403229
|View full text |Cite
|
Sign up to set email alerts
|

Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions

Abstract: Users of music streaming, video streaming, news recommendation, and e-commerce services often engage with content in a sequential manner. Providing and evaluating good sequences of recommendations is therefore a central problem for these services. Prior reweighting-based counterfactual evaluation methods either suffer from high variance or make strong independence assumptions about rewards. We propose a new counterfactual estimator that allows for sequential interactions in the rewards with lower variance in a… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

2
59
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 38 publications
(61 citation statements)
references
References 23 publications
2
59
0
Order By: Relevance
“…For a fair comparison they are therefore ignored when answering RQ2. Time of The Day: We define 5 time windows, where numbers in brackets correspond to the hour range: night (0-5), morning (6)(7)(8)(9)(10)(11), afternoon (12)(13)(14)(15)(16)(17), evening (18)(19)(20)(21)(22)(23), all (0-23). If a session spans across two hours, we round up and consider the whole session as either part of start or end hour.…”
Section: Experimental Settingsmentioning
confidence: 99%
See 1 more Smart Citation
“…For a fair comparison they are therefore ignored when answering RQ2. Time of The Day: We define 5 time windows, where numbers in brackets correspond to the hour range: night (0-5), morning (6)(7)(8)(9)(10)(11), afternoon (12)(13)(14)(15)(16)(17), evening (18)(19)(20)(21)(22)(23), all (0-23). If a session spans across two hours, we round up and consider the whole session as either part of start or end hour.…”
Section: Experimental Settingsmentioning
confidence: 99%
“…Modelling and understanding skipping behaviour in music listening sessions arguably plays a crucial role in understanding user behaviour in modern streaming services. For instance, the skipping signal has already been used as a measure in heuristic-based playlist generation systems [9,25], user satisfaction [16,28], relevance [17], and as counterfactual estimators in Recommender Systems [22].…”
Section: Introductionmentioning
confidence: 99%
“…One way to reduce the variance is introducing a reasonable assumption on user behavior to make the combinatorial item space tractable. However, unrealistically strong assumptions may cause serious bias in OPE [13]. Therefore, achieving a well-balanced bias-variance tradeoff by introducing an appropriate user behavior assumption is the key for enabling accurate OPE of ranking policies.…”
Section: Introductionmentioning
confidence: 99%
“…setting, however, IPS can suffer from large variance, as the item space is combinatorially large [12,13,23]. In contrast, Independent IPS (IIPS) is based on the independence assumption to address the variance issue [12].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation