Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence 2019
DOI: 10.24963/ijcai.2019/50
|View full text |Cite
|
Sign up to set email alerts
|

Compact Representation of Value Function in Partially Observable Stochastic Games

Abstract: Value methods for solving stochastic games with partial observability model the uncertainty about states of the game as a probability distribution over possible states. The dimension of this belief space is the number of states. For many practical problems, for example in security, there are exponentially many possible states which causes an insufficient scalability of algorithms for real-world problems. To this end, we propose an abstraction technique that addresses this issue of the curse of dimensionality b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 2 publications
0
4
0
Order By: Relevance
“…Direct methods such as dynamic programming are intractable in practice [13]. We list the following approaches that can compute policies on small POSGs: using a memory bounded representation of the value function [11]; heuristically reformulating as a Bayesian game [26]; performing value iteration and applying beliefbased heuristics [16]; and leveraging marginal probabilities to compactly represent the belief spaces [18] . While these general approaches rely on fewer assumptions than the proposed, dedicated, method, they are only tractable for model sizes of less than 10 6 states.…”
Section: B Related Workmentioning
confidence: 99%
“…Direct methods such as dynamic programming are intractable in practice [13]. We list the following approaches that can compute policies on small POSGs: using a memory bounded representation of the value function [11]; heuristically reformulating as a Bayesian game [26]; performing value iteration and applying beliefbased heuristics [16]; and leveraging marginal probabilities to compactly represent the belief spaces [18] . While these general approaches rely on fewer assumptions than the proposed, dedicated, method, they are only tractable for model sizes of less than 10 6 states.…”
Section: B Related Workmentioning
confidence: 99%
“…loss) functions, the player's strategies are computed by maximizing (resp. minimizing) the objective function [8], [11], [13]. Carrol and Grosu [5] used a signaling game to study honeypot deception.…”
Section: Introductionmentioning
confidence: 99%
“…Active deception [Jajodia et al, 2016] employs decoy systems and other defenses, including access control and online network reconfiguration, to conduct deceptive planning against the intrusion of malicious users who have been detected and confirmed by sensing systems. To design defense strategies with deception, game theory has been employed [Hor, 2012, Al-Shaer et al, 2019, Horak et al, 2019, Cohen, 2006, Zhu and Rass, 2018. These game-theoretic models express the attacker and defender's objectives using reward/loss functions.…”
Section: Introductionmentioning
confidence: 99%
“…These game-theoretic models express the attacker and defender's objectives using reward/loss functions. In [Horak et al, 2019], a partially observable stochastic game is formulated to capture the interaction between an attacker and a defender with one-sided partial observations. The attacker is to exploit and compromise the system without being detected and has complete observation.…”
Section: Introductionmentioning
confidence: 99%