2022
DOI: 10.48550/arxiv.2205.13746
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games

Abstract: We study the problem of finding the Nash equilibrium in a two-player zero-sum Markov game. Due to its formulation as a minimax optimization program, a natural approach to solve the problem is to perform gradient descent/ascent with respect to each player in an alternating fashion. However, due to the non-convexity/non-concavity of the underlying objective function, theoretical understandings of this method are limited. In our paper, we consider solving an entropy-regularized variant of the Markov game. The reg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 12 publications
0
2
0
Order By: Relevance
“…By choosing small regularization weights, their method can find an -Nash equilibrium in O(1/ ) iterations. Zeng et al (2022) also consider adding entropy regularization to help find Nash equilibria in zero-sum Markov games. They prove the O(t −1/3 ) convergence rate of a variant of GDA by driving regularization weights dynamically to zero.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…By choosing small regularization weights, their method can find an -Nash equilibrium in O(1/ ) iterations. Zeng et al (2022) also consider adding entropy regularization to help find Nash equilibria in zero-sum Markov games. They prove the O(t −1/3 ) convergence rate of a variant of GDA by driving regularization weights dynamically to zero.…”
Section: Related Workmentioning
confidence: 99%
“…However, due to the nonconvexity-nonconcavity, theoretical understanding of zero-sum Markov games is sparser. Existing methods have either sublinear rates for finding Nash equilibria, or linear rates for finding regularized Nash equiliria such as quantal response equilibria which are approximations for Nash equilibria Alacaoglu et al (2022); Cen et al (2021); Daskalakis et al (2020); Pattathil et al (2022); Perolat et al (2015); Wei et al (2021); Yang and Ma (2022); Zeng et al (2022); Zhang et al (2022); Zhao et al (2022). A natural question is:…”
Section: Introductionmentioning
confidence: 99%