2020
DOI: 10.48550/arxiv.2005.05409
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Solving high-dimensional Hamilton-Jacobi-Bellman PDEs using neural networks: perspectives from the theory of controlled diffusions and measures on path space

Abstract: Optimal control of diffusion processes is intimately connected to the problem of solving certain Hamilton-Jacobi-Bellman equations. Building on recent machine learning inspired approaches towards high-dimensional PDEs, we investigate the potential of iterative diffusion optimisation techniques, in particular considering applications in importance sampling and rare event simulation. The choice of an appropriate loss function being a central element in the algorithmic design, we develop a principled framework ba… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
9

Relationship

3
6

Authors

Journals

citations
Cited by 10 publications
(21 citation statements)
references
References 90 publications
0
21
0
Order By: Relevance
“…To the best of our knowledge, we are the first to establish a connection between IS and SOC in the context of pure jump processes, particularly for SRNs. Note that some existing works [26,30,28,27,36] have established a similar connection in the context of diffusion dynamics, mainly interested in efficiently estimating rare event probabilities using a path-dependent IS scheme.…”
Section: Introductionmentioning
confidence: 93%
“…To the best of our knowledge, we are the first to establish a connection between IS and SOC in the context of pure jump processes, particularly for SRNs. Note that some existing works [26,30,28,27,36] have established a similar connection in the context of diffusion dynamics, mainly interested in efficiently estimating rare event probabilities using a path-dependent IS scheme.…”
Section: Introductionmentioning
confidence: 93%
“…Finally, in the deterministic limit the method becomes local as there is no diffusion to enforce the trajectories to explore the whole space. Similar techniques are applied in [26]- [28] based on different loss functions.…”
Section: B High-dimensional Stochastic Optimal Controlmentioning
confidence: 99%
“…We now use these two ingredients to construct a connecting path between ρ 0 and ρ. By (56), there exists a T 1 < ∞ and a path ρ 1 : [0, T 1 ] → M connecting π to ρ 0 so that I [0,T1] (ρ 1 ) ≤ KL(ρ 0 )+1. By the time-reversal symmetry (57), the reversal ← − ρ 1 of this path connects ρ 0 to π, and satisfies I [0,T1] ( ← − ρ 1 ) ≤ 1.…”
Section: γ−Limmentioning
confidence: 99%