2019
DOI: 10.1609/aaai.v33i01.33012029
|View full text |Cite
|
Sign up to set email alerts
|

Solving Partially Observable Stochastic Games with Public Observations

Abstract: In many real-world problems, there is a dynamic interaction between competitive agents. Partially observable stochastic games (POSGs) are among the most general formal models that capture such dynamic scenarios. The model captures stochastic events, partial information of players about the environment, and the scenario does not have a fixed horizon. Solving POSGs in the most general setting is intractable.Therefore, the research has been focused on subclasses of POSGs that have a value of the game and admit de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 32 publications
(31 citation statements)
references
References 11 publications
0
31
0
Order By: Relevance
“…c) POSGs: As mentioned in Sec. II, HSVI has been first directly applied to particular zero-sum (two-player) partially observable stochastic games (zs-POSGs): with one-sided partial observability [11] and with public observability [12]. These problems allow (i) not having to deal with nested beliefs, and (ii) reasoning about beliefs such that the optimal value function is convex/concave in belief-space, which allows defining generalizing value function approximators (upperand lower-bounds) as in POMDPs.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…c) POSGs: As mentioned in Sec. II, HSVI has been first directly applied to particular zero-sum (two-player) partially observable stochastic games (zs-POSGs): with one-sided partial observability [11] and with public observability [12]. These problems allow (i) not having to deal with nested beliefs, and (ii) reasoning about beliefs such that the optimal value function is convex/concave in belief-space, which allows defining generalizing value function approximators (upperand lower-bounds) as in POMDPs.…”
Section: Discussionmentioning
confidence: 99%
“…Recent works addressed particular cases of discounted (twoplayer) zero-sum partially observable stochastic games (zs-POSGs) using heuristic search: Horák et al [11] considered One-Sided POSGs, i.e., the case where one player has access to the system state, plus the action and observation of the other player, and Horák and Bošanský [12] considered POSGs with public observations, i.e., the case where each Player i knows his own private state s i and both players receive the same public observations of each player's private state, so that they have common knowledge of Player −i's belief over Player i's private state. Moving from MDPs and POMDPs (as in Smith's work 2007) to these settings requires changes to the algorithm that make a different approach necessary to theoretically analyze the finite-time convergence.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…When both players have partial information about the environment, the players may need to reason not only about their belief over possible states, but also about the belief the opponent has over the possible states, beliefs over beliefs, and so on. Restricting to subclasses of POSGs where this issue of nested beliefs does not arise allows us to design and implement algorithms that are guaranteed to converge to optimal strategies [Horák et al, 2017;Horák and Bošanský, 2019]. However, the scalability of the current algorithms even for this case is limited.…”
Section: Introductionmentioning
confidence: 99%