Approximating optimal feedback controllers of finite horizon control problems using hierarchical tensor formats

Oster, Mathias; Sallandt, Leon; Schneider, Reinhold

doi:10.48550/arxiv.2104.06108

Cited by 2 publications

(10 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For recent applications of TTs as value function approximators, see e.g. [KKD19;OSS21a]. Solution methods based on high-dimensional polynomials and tensor spaces have also been considered in [DKK21;KK18].…”

Section: Related Workmentioning

confidence: 99%

“…This approach is based on Bellman's principle. However, in contrast to comparable recent work [OSS21a], we use the HJB equation (4) on each subinterval instead of the Bellman equation. In particular, we define suitable approximate solutions to the HJB equation by means of the Dirac-Frenkel variational principle.…”

Section: Theorem 3 ([Bc97]mentioning

confidence: 99%

“…TT approximations of the value function by means of such a backwards scheme were already presented e.g. in [OSS21a]. In that work however, the integral formulation (6) is used exclusively, sampling trajectories x(t) for given controls and adding up the costs.…”

Section: Assume Now Thatmentioning

confidence: 99%

“…As a benchmark for assessing the performance of our method, we use the TT-based approach from [OSS21a] with the same hyper-parameters. To make this precise, instead of solving (24) by means of our dynamical low-rank scheme, Vi is approximated in each policy iteration step by sampling the trajectories x k (t), t ∈ [t i , t i+1 ] of all sample points.…”

Section: Numerical Testsmentioning

confidence: 99%

“…Tree based tensor networks and tensor trains in particular have already been used for successful approximations of the value function in various works, see e.g. [Fac+20;KKD19;OSS21a]. These recent results are summarized in the PhD thesis of Leon Sallandt [Sal21], which is still being finalised as this paper is written.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Dynamical low-rank approximations of solutions to the Hamilton-Jacobi-Bellman equation

Eigel¹,

Schneider²,

Sommer³

2021

Preprint

Self Cite

View full text Add to dashboard Cite

We present a novel method to approximate optimal feedback laws for nonlinear optimal control based on low-rank tensor train (TT) decompositions. The approach is based on the Dirac-Frenkel variational principle with the modification that the optimisation uses an empirical risk. Compared to current state-of-the-art TT methods, our approach exhibits a greatly reduced computational burden while achieving comparable results. A rigorous description of the numerical scheme and demonstrations of its performance are provided.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Theorem 3 ([Bc97]mentioning

confidence: 99%

Section: Assume Now Thatmentioning

confidence: 99%

Section: Numerical Testsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Dynamical low-rank approximations of solutions to the Hamilton-Jacobi-Bellman equation

Eigel¹,

Schneider²,

Sommer³

2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

Dynamical low‐rank approximations of solutions to the Hamilton–Jacobi–Bellman equation

Eigel

Schneider

Sommer

2022

Numerical Linear Algebra App

View full text Add to dashboard Cite

We present a novel method to approximate optimal feedback laws for nonlinear optimal control based on low‐rank tensor train (TT) decompositions. The approach is based on the Dirac–Frenkel variational principle with the modification that the optimization uses an empirical risk. Compared to current state‐of‐the‐art TT methods, our approach exhibits a greatly reduced computational burden while achieving comparable results. A rigorous description of the numerical scheme and demonstrations of its performance are provided.

show abstract

Approximating optimal feedback controllers of finite horizon control problems using hierarchical tensor formats

Abstract: Controlling systems of ordinary differential equations (ODEs) is ubiquitous in science and engineering. For finding an optimal feedback controller, the value function and associated fundamental equations such as the Bellman equation and the Hamilton-Jacobi-Bellman (HJB) equa-

Cited by 2 publications

References 26 publications

Dynamical low-rank approximations of solutions to the Hamilton-Jacobi-Bellman equation

Dynamical low-rank approximations of solutions to the Hamilton-Jacobi-Bellman equation

Dynamical low‐rank approximations of solutions to the Hamilton–Jacobi–Bellman equation

Contact Info

Product

Resources

About