2022
DOI: 10.1093/nsr/nwac256
|View full text |Cite
|
Sign up to set email alerts
|

On the complexity of computing Markov perfect equilibrium in general-sum stochastic games

Abstract: Similar to the role of Markov decision processes in reinforcement learning, Markov games (also called stochastic games) lay down the foundation for the study of multi-agent reinforcement learning and sequential agent interactions. We introduce approximate Markov perfect equilibrium as a solution to the computational problem of finite-state stochastic games repeated in the infinite horizon and prove its PPAD-completeness. This solution concept preserves the Markov perfect property and opens up the possibility f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 13 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…This design involves a trade-off between the minimization of the communication cost and the quality of control, which becomes less accurate as the transmission rate is reduced, in both the push-and pull-based versions. Using this model, we analyze the advantages and drawbacks of each configuration, proving relevant results and showing that the push-based system, while having better performance at the optimum, is a PPAD-hard problem [15].…”
Section: Introductionmentioning
confidence: 84%
See 2 more Smart Citations
“…This design involves a trade-off between the minimization of the communication cost and the quality of control, which becomes less accurate as the transmission rate is reduced, in both the push-and pull-based versions. Using this model, we analyze the advantages and drawbacks of each configuration, proving relevant results and showing that the push-based system, while having better performance at the optimum, is a PPAD-hard problem [15].…”
Section: Introductionmentioning
confidence: 84%
“…However, reaching an NE is not a guarantee of Pareto optimality: games may have multiple NEs, and finding the optimal one is PPAD-hard [15]. The push-based approach may be actively harmful, even with respect to an AoI policy.…”
Section: Age and Value Of Information In Effective Communicationmentioning
confidence: 99%
See 1 more Smart Citation