2020
DOI: 10.1016/j.sysconle.2020.104744
|View full text |Cite
|
Sign up to set email alerts
|

Value iteration algorithm for mean-field games

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 23 publications
0
10
0
Order By: Relevance
“…Recall that sup x∈X g(x) ≤ diam(X) and sup x∈X g 2 (x) ≤ diam(X) 2 . Therefore, we can bound the last term as follows…”
Section: Recall the Optimality Equation For Mdpmentioning
confidence: 99%
See 1 more Smart Citation
“…Recall that sup x∈X g(x) ≤ diam(X) and sup x∈X g 2 (x) ≤ diam(X) 2 . Therefore, we can bound the last term as follows…”
Section: Recall the Optimality Equation For Mdpmentioning
confidence: 99%
“…The previous studies reviewed only establish the existence and uniqueness of the meanfield equilibrium, but do not provide an algorithm with a guarantee of convergence to compute it, with the exception of the model with linear state dynamics and two other papers [41,2]. This work investigates this problem for linear mean-field games and proposes an algorithm that formulates the game as a GNEP, proving the convergence of the algorithm to the stationary mean-field equilibrium.…”
mentioning
confidence: 99%
“…For linear-quadratic mean field games, Li and Zhang [30] employed the fixed point method to study mean field games with time-averaged stochastic cost functionals and introduced the notion of decentralized asymptotic Nash equilibrium in the sense of probability; Bardi [2] discussed the case with time-averaged deterministic cost functionals by the direct method developed in [26][27][28]; Huang and Zhou [23] studied the asymptotic solvability of linear-quadratic mean field games by establishing a scale reset method; Bensoussan et al [3] considered linear-quadratic mean field games based on the stochastic maximum principle. For nonlinear mean field games, Anahtarci et al [1] developed value iteration algorithms to investigate mean field games with discounted cost functionals and time-averaged cost functionals respectively, then they proved that the sequence of strategies designed is a decentralized asymptotic Nash equilibrium.…”
Section: Introductionmentioning
confidence: 99%
“…We use the subscript L and subscript F as the label of major and minor players. For a vector or matrix X, X ⊤ denotes the transposition of X, ∥X∥ denotes the 2norm of vector or Frobenius norm of matrix, and Tr(X) denotes the trace of X; I n denotes the n×n-dimensional identity matrix, R n denotes the set of n-dimensional real column vectors, and R n×m denotes the set of n × mdimensional real matrices; C 1,2…”
Section: Introductionmentioning
confidence: 99%
“…As a cornerstone for applications, the development of numerical methods for these mean-field problems has also attracted a growing interest. Assuming full knowledge of the model, methods for which convergence guarantees have been established include finite difference schemes for partial differential equations [1,2], semi-Lagrangian schemes [22], augmented Lagrangian or primal-dual methods [5,17,18], value iteration algorithm [9], or neural network based stochastic methods [26,27]; see e.g. [4] for a recent overview.…”
mentioning
confidence: 99%