2019
DOI: 10.1137/18m1203900
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient DP Algorithm on a Tree-Structure for Finite Horizon Optimal Control Problems

Abstract: The classical Dynamic Programming (DP) approach to optimal control problems is based on the characterization of the value function as the unique viscosity solution of a Hamilton-Jacobi-Bellman (HJB) equation. The DP scheme for the numerical approximation of viscosity solutions of Bellman equations is typically based on a time discretization which is projected on a fixed state-space grid. The time discretization can be done by a one-step scheme for the dynamics and the projection on the grid typically uses a lo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

2
60
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 48 publications
(62 citation statements)
references
References 22 publications
2
60
0
Order By: Relevance
“…The former dynamics may be observed almost completely, while the latter dynamics can be almost partly as considered in the presented model. Considering several state variables leads to an optimization problem having a huge degree of freedom whose numerical computation encounters the cure of dimensionality unless a sparse and efficient method is employed 97 . Considering both completely observable and partially observable dynamics would pose additional theoretical and technical issues.…”
Section: Discussionmentioning
confidence: 99%
“…The former dynamics may be observed almost completely, while the latter dynamics can be almost partly as considered in the presented model. Considering several state variables leads to an optimization problem having a huge degree of freedom whose numerical computation encounters the cure of dimensionality unless a sparse and efficient method is employed 97 . Considering both completely observable and partially observable dynamics would pose additional theoretical and technical issues.…”
Section: Discussionmentioning
confidence: 99%
“…In the literature there are different techniques to tackle this situation. Related recent works include, polynomial approximation [28], deep neural technique [32], tensor calculus [18], Taylor series expansions [13] and graph-tree structures [5].…”
Section: Introductionmentioning
confidence: 99%
“…However, the design of numerical methods for the solution of high-dimensional HJ PDEs remains a daunting task. Along this direction, some encouraging results have been obtained over the last years in connection with the use of sparse grids [11,29], causality-free methods [34,22,48], machine learning techniques [40,39], tensor calculus [47], graph-tree structures [3], Taylor series expansions [17], and polynomial approximation [32]. In this latter work, we develop a numerical scheme based on a high-dimensional polynomial ansatz for the value function coupled with a Newton-type (policy iteration) method for the solution of the Galerkin residual equation associated to the HJB equation.…”
Section: Introductionmentioning
confidence: 99%