2023
DOI: 10.48550/arxiv.2303.03327
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Lower Bounds for $γ$-Regret via the Decision-Estimation Coefficient

Abstract: In this note, we give a new lower bound for the γ-regret in bandit problems, the regret which arises when comparing against a benchmark that is γ times the optimal solution, i.e., Reg γ (T ) = T t=1 γ maxπ f (π) − f (πt). The γ-regret arises in structured bandit problems where finding an exact optimum of f is intractable. Our lower bound is given in terms of a modification of the constrained Decision-Estimation Coefficient (DEC) of Foster et al. [2023] (and closely related to the original offset DEC of Foster … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 5 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?