“…From a broader perspective, our work relates to bandits with complex rewards schemes, i.e., abandonment elements [3,11,44], and non-stationary rewards [4,22,26,27,34,38]. Our work is also related to multi-stakeholder recommendation systems [5,6,9,30,39,45] and fairness in machine learning [12,29,43].…”