On the complexity of computing Markov perfect equilibrium in general-sum stochastic games

Deng, Xin; Li, Yuhao; Mguni, David; Wang, Jun; Yang, Yaodong

doi:10.1093/nsr/nwac256

Cited by 13 publications

(3 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This design involves a trade-off between the minimization of the communication cost and the quality of control, which becomes less accurate as the transmission rate is reduced, in both the push-and pull-based versions. Using this model, we analyze the advantages and drawbacks of each configuration, proving relevant results and showing that the push-based system, while having better performance at the optimum, is a PPAD-hard problem [15].…”

Section: Introductionmentioning

confidence: 84%

“…However, reaching an NE is not a guarantee of Pareto optimality: games may have multiple NEs, and finding the optimal one is PPAD-hard [15]. The push-based approach may be actively harmful, even with respect to an AoI policy.…”

Section: Age and Value Of Information In Effective Communicationmentioning

confidence: 99%

“…As finding the optimal solution to a Markov game is PPADhard [15], no polynomial-time algorithm can reliably find π * B,push . We can then give a counterexample to prove the second part of the theorem: we consider a simple MDP with 5 states and 2 actions, whose evolution is depicted in Fig.…”

Section: Age and Value Of Information In Effective Communicationmentioning

confidence: 99%

See 2 more Smart Citations

Semantic and Effective Communication for Remote Control Tasks with Dynamic Feature Compression

Talli,

Pase,

Chiariotti

et al. 2023

IEEE INFOCOM 2023 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)

View full text Add to dashboard Cite

The remote wireless control of industrial systems is one of the major use cases for 5G and beyond systems: in these cases, the massive amounts of sensory information that need to be shared over the wireless medium may overload even high-capacity connections. Consequently, solving the effective communication problem by optimizing the transmission strategy to discard irrelevant information can provide a significant advantage, but is often a very complex task. In this work, we consider a prototypal system in which an observer must communicate its sensory data to a robot controlling a task (e.g., a mobile robot in a factory). We then model it as a remote Partially Observable Markov Decision Process (POMDP), considering the effect of adopting semantic and effective communication-oriented solutions on the overall system performance. We split the communication problem by considering an ensemble Vector Quantized Variational Autoencoder (VQ-VAE) encoding, and train a Deep Reinforcement Learning (DRL) agent to dynamically adapt the quantization level, considering both the current state of the environment and the memory of past messages. We tested the proposed approach on the well-known CartPole reference control problem, obtaining a significant performance increase over traditional approaches.

show abstract

Section: Introductionmentioning

confidence: 84%

Section: Age and Value Of Information In Effective Communicationmentioning

confidence: 99%

See 1 more Smart Citation

Semantic and Effective Communication for Remote Control Tasks with Dynamic Feature Compression

Talli,

Pase,

Chiariotti

et al. 2023

IEEE INFOCOM 2023 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)

View full text Add to dashboard Cite

show abstract

Dynamic Operational Planning in Warfare: A Stochastic Game Approach to Military Campaigns

McCarthy,

Dahan,

White

2024

Naval Research Logistics

View full text Add to dashboard Cite

We study a two‐player discounted zero‐sum stochastic game model for dynamic operational planning in military campaigns. At each stage, the players manage multiple commanders who order military actions on objectives that have an open line of control. When a battle over the control of an objective occurs, its stochastic outcome depends on the actions and the enabling support provided by the control of other objectives. Each player aims to maximize the cumulative number of objectives they control, weighted by their criticality. To solve this large‐scale stochastic game, we derive properties of its Markov perfect equilibria by leveraging the logistics and military operational command and control structure. We show the consequential isotonicity of the optimal value function with respect to the partially ordered state space, which in turn leads to a significant reduction of the state and action spaces. We also accelerate Shapley's value iteration algorithm by eliminating dominated actions and investigating pure equilibria of the matrix game solved at each iteration. We demonstrate the computational value of our equilibrium results on a case study that reflects representative operational‐level military campaigns with geopolitical implications. Our analysis reveals a complex interplay between the game's parameters and dynamics in equilibrium, resulting in new military insights for campaign analysts.

show abstract

A Generalizable Autonomous Maneuvering Decision-Making Method for UCAV Air Combat Combining PER-D3QN and Zero-Sum Markov Game

Li,

Xu,

Wang

2024

Lecture Notes in Electrical Engineering

View full text Add to dashboard Cite

On the complexity of computing Markov perfect equilibrium in general-sum stochastic games

Cited by 13 publications

References 23 publications

Semantic and Effective Communication for Remote Control Tasks with Dynamic Feature Compression

Semantic and Effective Communication for Remote Control Tasks with Dynamic Feature Compression

Dynamic Operational Planning in Warfare: A Stochastic Game Approach to Military Campaigns

A Generalizable Autonomous Maneuvering Decision-Making Method for UCAV Air Combat Combining PER-D3QN and Zero-Sum Markov Game

Contact Info

Product

Resources

About