2020
DOI: 10.48550/arxiv.2002.01197
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Selfish Robustness and Equilibria in Multi-Player Bandits

Abstract: Motivated by cognitive radios, stochastic multi-player multi-armed bandits gained a lot of interest recently. In this class of problems, several players simultaneously pull arms and encounter a collision -with 0 reward -if some of them pull the same arm at the same time. While the cooperative case where players maximize the collective reward (obediently following some fixed protocol) has been mostly considered, robustness to malicious players is a crucial and challenging concern. Existing approaches consider o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 19 publications
0
3
0
Order By: Relevance
“…In this section, we discuss robustness properties when any agent deviates from executing the UCB-D3 algorithm in order to maximize the collected reward. We consider a desirable robustness property called ε Nash Equilibrium for multi-agent algorithms, recently proposed in [8]. Roughly, this property guarantees that no agent can significantly increase (by at-most additive ε) its rewards by unilaterally deviating from the UCB-D3; protocol.…”
Section: Incentive Compatibility and Robustness To Selfish Playersmentioning
confidence: 99%
See 2 more Smart Citations
“…In this section, we discuss robustness properties when any agent deviates from executing the UCB-D3 algorithm in order to maximize the collected reward. We consider a desirable robustness property called ε Nash Equilibrium for multi-agent algorithms, recently proposed in [8]. Roughly, this property guarantees that no agent can significantly increase (by at-most additive ε) its rewards by unilaterally deviating from the UCB-D3; protocol.…”
Section: Incentive Compatibility and Robustness To Selfish Playersmentioning
confidence: 99%
“…Roughly, this property guarantees that no agent can significantly increase (by at-most additive ε) its rewards by unilaterally deviating from the UCB-D3; protocol. Although this is a weaker concept compared to the classical Nash-Equilibrium used in the theory of repeated games [27], is nevertheless a useful property for practical algorithms in multi-agent bandits to posses [8].…”
Section: Incentive Compatibility and Robustness To Selfish Playersmentioning
confidence: 99%
See 1 more Smart Citation