2015
DOI: 10.1007/978-3-662-48995-6_30
|View full text |Cite
|
Sign up to set email alerts
|

Inverse Game Theory: Learning Utilities in Succinct Games

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
20
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(20 citation statements)
references
References 23 publications
0
20
0
Order By: Relevance
“…Under the condition (I −P iP + i )p i = 0 and by noting thatP i is positive semidefinite due to P i (0) being positive semidefinite, [33,Proposition 15.2] gives that this quadratic program is solved by anyα i satisfying (16) for any b ∈ R Mi+n−1−rP i . The first theorem assertion (15) and (16) follows. Now, from (16), clearly,…”
Section: B Solution Of Soft-constrained Methodsmentioning
confidence: 83%
“…Under the condition (I −P iP + i )p i = 0 and by noting thatP i is positive semidefinite due to P i (0) being positive semidefinite, [33,Proposition 15.2] gives that this quadratic program is solved by anyα i satisfying (16) for any b ∈ R Mi+n−1−rP i . The first theorem assertion (15) and (16) follows. Now, from (16), clearly,…”
Section: B Solution Of Soft-constrained Methodsmentioning
confidence: 83%
“…Approx. IESAR (ours) Markov Nash Yes MA-AIRL (Yu et al, 2019) Markov LSBRE Yes (Lin et al, 2019) Markov Various No (Natarajan et al, 2010) Markov Cooperative Game No (Waugh et al, 2011) Normal-Form Correlated Yes (Bestick et al, 2013) Normal-Form Correlated Yes (Kuleshov & Schrijvers, 2015) Normal-Form Correlated No Table 1: A comparison of our method against prior works in multi-agent inverse reinforcement learning and inverse game theory. The final column, labeled "Func.…”
Section: Algorithmmentioning
confidence: 99%
“…Inverse game theory. Within the context of game-theoretic and multi-agent settings, inverse game theory (Kuleshov & Schrijvers, 2015) and the inverse equilibrium problem (Waugh et al, 2011;Bestick et al, 2013) present general formulations for inferring utilities in normal form games using linear programming. In particular, these methods assume the agents rationalize to a correlated equilibrium, and Waugh et al (2011) further adopts the MaxEnt principle to disambiguate among correlated equilibria that rationalizes observed behavior.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, Honorio and Ortiz (2015) adopt a specific generative model, and optimize the fit of key parameters to the available data. Other works also employ diverse optimization techniques to uncover the payoff matrix under a best-response constraint (Kuleshov and Schrijvers 2015;Waugh, Ziebart, and Bagnell 2011;Ling, Fang, and Kolter 2018;. Models learned with such techniques have been shown to fit well to some realworld scenarios (Garg and Jaakkola 2016;.…”
Section: Multiagent Simulation For Game Model Learningmentioning
confidence: 99%