2023
DOI: 10.48550/arxiv.2302.12202
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Definition of Non-Stationary Bandits

Abstract: The subject of non-stationary bandit learning has attracted much recent attention. However, nonstationary bandits lack a formal definition. Loosely speaking, non-stationary bandits have typically been characterized in the literature as those for which the reward distribution changes over time. We demonstrate that this informal definition is ambiguous. Further, a widely-used notion of regret-the dynamic regret-is motivated by this ambiguous definition and thus problematic. In particular, even for an optimal age… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 19 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?