2021
DOI: 10.48550/arxiv.2101.01301
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Adversarial Combinatorial Bandits with General Non-linear Reward Functions

Xi Chen,
Yanjun Han,
Yining Wang

Abstract: In this paper we study the adversarial combinatorial bandit with a known non-linear reward function, extending existing work on adversarial linear combinatorial bandit. The adversarial combinatorial bandit with general non-linear reward is an important open problem in bandit literature, and it is still unclear whether there is a significant gap from the case of linear reward, stochastic bandit, or semi-bandit feedback. We show that, with N arms and subsets of K arms being chosen at each of T time periods, the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…Despite the increasing awareness on adaptive arms in CMAB and the significant effort made by the community, CMAB is mostly investigated with adversarial arms [Combes et al, 2015, Wei and Luo, 2018, Sakaue et al, 2018, Chen et al, 2021. This presumes the arms to be stronger than necessary, which leads to implementations of overcautious algorithms that unnecessarily sacrifice utility for their worst-case guarantees.…”
Section: Introductionmentioning
confidence: 99%
“…Despite the increasing awareness on adaptive arms in CMAB and the significant effort made by the community, CMAB is mostly investigated with adversarial arms [Combes et al, 2015, Wei and Luo, 2018, Sakaue et al, 2018, Chen et al, 2021. This presumes the arms to be stronger than necessary, which leads to implementations of overcautious algorithms that unnecessarily sacrifice utility for their worst-case guarantees.…”
Section: Introductionmentioning
confidence: 99%