Proceedings of the Fifteenth ACM Conference on Economics and Computation 2014
DOI: 10.1145/2600057.2602897
|View full text |Cite
|
Sign up to set email alerts
|

Incentivizing exploration

Abstract: We study a Bayesian multi-armed bandit (MAB) setting in which a principal seeks to maximize the sum of expected time-discounted rewards obtained by pulling arms, when the arms are actually pulled by selfish and myopic individuals. Since such individuals pull the arm with highest expected posterior reward (i.e., they always exploit and never explore), the principal must incentivize them to explore by offering suitable payments. Among others, this setting models crowdsourced information discovery and funding age… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
87
0
1

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 94 publications
(89 citation statements)
references
References 27 publications
1
87
0
1
Order By: Relevance
“…Our results are related to recent work on incentivizing exploration in a bandit model Frazier et al (2014); Mansour et al ( , 2016. These papers typically model a myopic decision-maker in each round, and an informed non-myopic principle who can influence the decision-maker to explore rather than exploit.…”
Section: Further Related Worksupporting
confidence: 85%
“…Our results are related to recent work on incentivizing exploration in a bandit model Frazier et al (2014); Mansour et al ( , 2016. These papers typically model a myopic decision-maker in each round, and an informed non-myopic principle who can influence the decision-maker to explore rather than exploit.…”
Section: Further Related Worksupporting
confidence: 85%
“…The user can compute the expected cost of travelling along P1 when the recommendation is P2 according to the stationary distribution P πopt of the belief state x. By (11) it is larger than c M , that is,…”
Section: Definition 1 Information Restriction Mechanism (Irm)mentioning
confidence: 99%
“…Frazier et al [7] consider a model with monetary transfers, where the social planner can pay agents to explore. Che and Hörner [3] consider a setting with two binary-valued actions and continuous information flow and a continuum of agents.…”
Section: Related Workmentioning
confidence: 99%
“…The planner can induce explorations in many ways. The simplest is using monetary transfers, paying the agents in order to explore (for example, Frazier et al [7]). We are interested in the case when the social planner is unable or prefers to avoid any monetary transfers.…”
Section: Introductionmentioning
confidence: 99%