Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence 2021
DOI: 10.24963/ijcai.2021/645
|View full text |Cite
|
Sign up to set email alerts
|

Decentralized No-regret Learning Algorithms for Extensive-form Correlated Equilibria (Extended Abstract)

Abstract: The existence of uncoupled no-regret learning dynamics converging to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium. Extensive-form games generalize normal-form games by modeling both sequential and simultaneous moves, as well … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
24
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
5

Relationship

3
7

Authors

Journals

citations
Cited by 14 publications
(24 citation statements)
references
References 25 publications
0
24
0
Order By: Relevance
“…Extensive-form games Finally, we remark that there is another long line of research on MARL based on the model of extensive-form games (EFG) [see, e.g., 28,19,65,9,10,12]. Results on learning EFGs do not directly imply results for learning MGs, since EFGs are naturally treestructured games, which can not efficiently represent MGs-graph-structured games where a state at the h th step can be the child of multiple states at the (h − 1) th step.…”
Section: Introductionmentioning
confidence: 99%
“…Extensive-form games Finally, we remark that there is another long line of research on MARL based on the model of extensive-form games (EFG) [see, e.g., 28,19,65,9,10,12]. Results on learning EFGs do not directly imply results for learning MGs, since EFGs are naturally treestructured games, which can not efficiently represent MGs-graph-structured games where a state at the h th step can be the child of multiple states at the (h − 1) th step.…”
Section: Introductionmentioning
confidence: 99%
“…For instance, we have that 135 = 145 , and 145 = 145 . We are now ready to define the concept of trigger regret minimization, which extends and generalizes the homonymous notion in the conference version of this paper [9], as well as the notion of internal regret minimization in normal-form games. Definition 3.6.…”
Section: Arxiv Preprintmentioning
confidence: 99%
“…Moreover, given that no-external regret learning dynamics converge to the set of coarse correlated equilibria (see [8,20]), Blum et al [9] study the price of total anarchy in games in which players take decisions so as to minimize their external regret. No-regret learning dynamics converging to correlated equilibria have been studied in various settings (see, e.g., [36,42,18,19,32]). Moreover, Hartline et al [43] and Caragiannis et al [15] study the quality of outcomes emerging from no-regret dynamics in Bayesian settings.…”
Section: Related Workmentioning
confidence: 99%