2019
DOI: 10.48550/arxiv.1911.09458
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Observe Before Play: Multi-armed Bandit with Pre-observations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 16 publications
0
2
0
Order By: Relevance
“…In a recent work [10], the authors study a bandit problem with Bernoulli rewards where in each round, the decision maker proposes an ordered list of the K arms, and plays the first arm with observed reward equal to 1. This problem is similar to that investigated in [11].…”
Section: Stochastic Bandit Problems Have Been Extensively Studiedmentioning
confidence: 99%
See 1 more Smart Citation

Predictive Bandits

Lindståhl,
Proutiere,
Johnsson
2020
Preprint
“…In a recent work [10], the authors study a bandit problem with Bernoulli rewards where in each round, the decision maker proposes an ordered list of the K arms, and plays the first arm with observed reward equal to 1. This problem is similar to that investigated in [11].…”
Section: Stochastic Bandit Problems Have Been Extensively Studiedmentioning
confidence: 99%
“…). (10) This implies that an asymptotic lower bound for the regret is C(θ) log(T ), where C(θ) is the value of the solution of the following optimization problem.…”
Section: Appendix I Proof Of Theoremmentioning
confidence: 99%

Predictive Bandits

Lindståhl,
Proutiere,
Johnsson
2020
Preprint