2021
DOI: 10.48550/arxiv.2104.01520
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Simple Uncoupled No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

Abstract: The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium. Extensive-form (that is, tree-form) games generalize normal-form games by modeling both sequential and simult… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
14
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(14 citation statements)
references
References 30 publications
0
14
0
Order By: Relevance
“…Later, Farina et al (2019) propose a min-max optimization formulation of EFCEs which can be solved by first-order methods. Celli et al (2020) and its extended version (Farina et al, 2021a) design the first uncoupled no-regret algorithm for computing EFCEs. Their algorithms are based on minimizing the trigger regret (first considered in Dudik and Gordon (2012); Gordon et al (2008)) via counterfactual regret decomposition (Zinkevich et al, 2007).…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Later, Farina et al (2019) propose a min-max optimization formulation of EFCEs which can be solved by first-order methods. Celli et al (2020) and its extended version (Farina et al, 2021a) design the first uncoupled no-regret algorithm for computing EFCEs. Their algorithms are based on minimizing the trigger regret (first considered in Dudik and Gordon (2012); Gordon et al (2008)) via counterfactual regret decomposition (Zinkevich et al, 2007).…”
Section: Related Workmentioning
confidence: 99%
“…The iteration complexity for computing an ε-approximate correlated equilibrium in both (Farina et al, 2021a;Morrill et al, 2021) scales quadratically in max i∈[m] X i . Our K-EFR algorithm for the full feedback setting builds upon the EFR algorithm, but specializes to the notion of K-EFCE, and achieve an improved linear in max i∈[m] X i iteration complexity.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Moreover, given that no-external regret learning dynamics converge to the set of coarse correlated equilibria (see [8,20]), Blum et al [9] study the price of total anarchy in games in which players take decisions so as to minimize their external regret. No-regret learning dynamics converging to correlated equilibria have been studied in various settings (see, e.g., [36,42,18,19,32]). Moreover, Hartline et al [43] and Caragiannis et al [15] study the quality of outcomes emerging from no-regret dynamics in Bayesian settings.…”
Section: Related Workmentioning
confidence: 99%
“…The problem of computing one EFCE (and, therefore, one NFCCE/EFCCE) can be solved in polynomial time in the size of the game tree [Huang and von Stengel, 2008] via a variation of the Ellipsoid Against Hope algorithm [Jiang andLeyton-Brown, 2015, Papadimitriou andRoughgarden, 2008]. Moreover, there exist decentralized no-regret learning dynamics guaranteeing that the empirical frequency of play after 𝑇 rounds is an 𝑂 (1/ √ 𝑇 )-approximate EFCE with high probability, and an EFCE almost surely in the limit [Celli et al, 2020, Farina et al, 2021b. Furthermore, NFCCE is the equilibrium notion that gets satisfied when all players in a game play according to any regret minimizer.…”
Section: Introductionmentioning
confidence: 99%