2020
DOI: 10.1007/s10208-020-09460-1
|View full text |Cite
|
Sign up to set email alerts
|

A Neural Network-Based Policy Iteration Algorithm with Global $$H^2$$-Superlinear Convergence for Stochastic Games on Domains

Abstract: In this work, we propose a class of numerical schemes for solving semilinear Hamilton-Jacobi-Bellman-Isaacs (HJBI) boundary value problems which arise naturally from exit time problems of diffusion processes with controlled drift. We exploit policy iteration to reduce the semilinear problem into a sequence of linear Dirichlet problems, which are subsequently approximated by a multilayer feedforward neural network ansatz. We establish that the numerical solutions converge globally in the H 2norm and further dem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 24 publications
(12 citation statements)
references
References 42 publications
0
12
0
Order By: Relevance
“…We then extend the residual based method for scalar PDEs in [44,31] to the coupled PDE system (4.3).…”
Section: Implementation Of the Fipde Methods Via Residual Approximationmentioning
confidence: 99%
“…We then extend the residual based method for scalar PDEs in [44,31] to the coupled PDE system (4.3).…”
Section: Implementation Of the Fipde Methods Via Residual Approximationmentioning
confidence: 99%
“…Recent years have seen progress, in particular in the context of option pricing for Black-Scholes-type models, for DNN-based numerical approximation of diffusion models on possibly large baskets (see e.g. Berner et al [9], Elbrächter et al [22] and Ito et al [34], Reisinger and Zhang [45] for game-type options). These references prove that DNN-based approximations of option prices on possibly large baskets of risky assets can overcome the so-called curse of dimensionality in the context of affine diffusion models for the dynamics of the (log-)prices of the underlying risky assets.…”
Section: Introductionmentioning
confidence: 99%
“…We refer, for instance, to [43,44] for approximation methods for semilinear parabolic PDEs based on standard Monte Carlo approximations for nested conditional expectations. We refer, for instance, to [5,12,13,16,17,21,33,34,39,41] and the references therein for deep learning-based approximation methods for high-dimensional PDEs. We refer, for instance, to [14,15,29] for full-history recursive multilevel Picard approximation methods for semilinear parabolic PDEs (in the following we abbreviate full-history recursive multilevel Picard by MLP).…”
Section: Introductionmentioning
confidence: 99%