2021
DOI: 10.48550/arxiv.2103.02741
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Combinatorial Bandits without Total Order for Arms

Shuo Yang,
Tongzheng Ren,
Inderjit S. Dhillon
et al.

Abstract: We consider the combinatorial bandits problem, where at each time step, the online learner selects a size-k subset s from the arms set A, where |A| = n, and observes a stochastic reward of each arm in the selected set s. The goal of the online learner is to minimize the regret, induced by not selecting s * which maximizes the expected total reward. Specifically, we focus on a challenging setting where 1) the reward distribution of an arm depends on the set s it is part of, and crucially 2) there is no total or… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 13 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?