Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2023
DOI: 10.1145/3580305.3599473
|View full text |Cite
|
Sign up to set email alerts
|

PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…These manually generated labels, along with other feedback like comments, are then utilized in a multi-task learning framework to improve recommendations [3]. However, these rule-based methods require notable manual effort and may not consistently align with the operational metrics [34] of the recommender system, such as user engagement. For example, the proposed unbiased watch time label in [37] has been observed to reduce user engagement, evidenced by a reduction in "share" [37].…”
Section: Introductionmentioning
confidence: 99%
“…These manually generated labels, along with other feedback like comments, are then utilized in a multi-task learning framework to improve recommendations [3]. However, these rule-based methods require notable manual effort and may not consistently align with the operational metrics [34] of the recommender system, such as user engagement. For example, the proposed unbiased watch time label in [37] has been observed to reduce user engagement, evidenced by a reduction in "share" [37].…”
Section: Introductionmentioning
confidence: 99%
“…• We collect the first reinforcement learning from human feedback (RLHF) dataset for long-term engagement optimization problem in recommendation and propose three new tasks to evaluate the performance of recommender 1 The work in this chapter has been published as Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An. PrefRec: Recommender systems with human preferences for reinforcing long-term user engagement [131]. Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2023.…”
Section: Guidances Of Human Preferencesmentioning
confidence: 99%