2016
DOI: 10.48550/arxiv.1602.07182
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Explore First, Exploit Next: The True Shape of Regret in Bandit Problems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
19
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(19 citation statements)
references
References 0 publications
0
19
0
Order By: Relevance
“…Perhaps the most important remaining problem for the subgaussian noise model is the question of lower bounds. Besides the asymptotic results by Lai and Robbins [1985] and Burnetas and Katehakis [1997] there has been some recent progress on finite-time lower bounds, both in the OCUCB paper and a recent article by Garivier et al [2016]. Some further progress is made in Appendix A, but still there are regimes where the bounds are not very precise.…”
Section: Discussionmentioning
confidence: 99%
“…Perhaps the most important remaining problem for the subgaussian noise model is the question of lower bounds. Besides the asymptotic results by Lai and Robbins [1985] and Burnetas and Katehakis [1997] there has been some recent progress on finite-time lower bounds, both in the OCUCB paper and a recent article by Garivier et al [2016]. Some further progress is made in Appendix A, but still there are regimes where the bounds are not very precise.…”
Section: Discussionmentioning
confidence: 99%
“…using the fact that the binary KL-divergence satisfies kl(x, y) = kl(1 − x, 1 − y) as well as the inequality kl(x, y) ≥ x log (1 y) − log(2), proved by Garivier et al (2016). Now, using Markov inequality yields…”
Section: A3 Proof Of Lemmamentioning
confidence: 98%
“…λ), given a fixed algorithm. Using the exact same technique as Garivier et al (2016) (the contraction of entropy principle), one can establish that for any event A that is σ(I t )-measurable 7 , KL P It µ , P It λ ≥ kl (P µ (A), P λ (A)) .…”
Section: B2 Proof Of Lemma 12mentioning
confidence: 99%
See 2 more Smart Citations