Nonlinear optimal control: approximations via moments and LMI-relaxations

Lasserre, Jean-Bernard; Prieur, Christophe; Henrion, Didier

doi:10.1109/cdc.2005.1582395

Cited by 19 publications

(19 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the current paper, we propose some techniques to constructively derive a control law from the solution of the convex linear matrix inequality (LMI) relaxations of the OCP. So our contribution can be seen as an extension to synthesis of the performance analysis results of [4,5].…”

Section: Discussionmentioning

confidence: 99%

“…In this paper we consider the class of OCPs for which all problem data are polynomial. The approach we deploy (which was introduced in [4]) is based on moment theory and consists in deriving a hierarchy of convex linear matrix inequality (LMI) relaxations of the OCP which give an increasing sequence of lower bounds on the optimal value. These LMI problems can be solved using off-the-shelf semidefinite programming (SDP) solvers.…”

Section: Introductionmentioning

confidence: 99%

“…The contribution with respect to [4] and its extended version [5] is twofold. First, the derivation of the relaxation is obtained in a simpler way, starting from basic concepts.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Nonlinear optimal control synthesis via occupation measures

Henrion

Lasserre

Savorgnan

2008

2008 47th IEEE Conference on Decision and Control

View full text Add to dashboard Cite

We consider nonlinear optimal control problems (OCPs) for which all problem data are polynomial. In the first part of the paper, we review how occupation measures can be used to approximate pointwise the optimal value function of a given OCP, using a hierarchy of linear matrix inequality (LMI) relaxations. In the second part, we extend the methodology to approximate the optimal value function on a given set and we use such a function to constructively and computationally derive an almost optimal control law. Numerical examples show the effectiveness of the approach.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Nonlinear optimal control synthesis via occupation measures

Henrion

Lasserre

Savorgnan

2008

2008 47th IEEE Conference on Decision and Control

View full text Add to dashboard Cite

show abstract

“…For example, consider the GPM arising when solving polynomial optimal control problems as detailed in [7]. We are seeking two occupation measures dµ 1 (x, u) and dµ 2 (x) of a state vector x(t) and input vector u(t) whose time variation are governed by the differential…”

Section: Several Measuresmentioning

confidence: 99%

“…As explained in [7], a lower bound on the For the initial condition x 0 = [1 1] the exact minimum time is equal to 3.5. In Table 1 …”

Section: Several Measuresmentioning

confidence: 99%

GloptiPoly 3: moments, optimization and semidefinite programming

Henrion

Lasserre

Löfberg

2009

Optimization Methods and Software

524

476

View full text Add to dashboard Cite

show abstract

Model‐based reinforcement learning for nonlinear optimal control with practical asymptotic stability guarantees

Kim

Lee

2020

AIChE Journal

View full text Add to dashboard Cite

We propose a new reinforcement learning approach for nonlinear optimal control where the value function is updated as restricted to control Lyapunov function (CLF) and the policy is improved using a variation of Sontag's formula. The practical asymptotic stability of the closed‐loop system is guaranteed during the training and at the end of training without requiring an additional actor network and its update rule. For a single‐layer neural network (NN) with exact basis functions, the approximate function converges to the optimal value function, resulting in the optimal controller. When a deep NN is used, the level set shapes of the trained NN become similar to those of the optimal value function. Because Sontag's formula with CLF is equivalent to the optimal controller when the given CLF has the same level set shapes as the optimal value function, Sontag's formula with the trained NN provides a nearly optimal controller.

show abstract

Nonlinear optimal control: approximations via moments and LMI-relaxations

Cited by 19 publications

References 16 publications

Nonlinear optimal control synthesis via occupation measures

Nonlinear optimal control synthesis via occupation measures

GloptiPoly 3: moments, optimization and semidefinite programming

Model‐based reinforcement learning for nonlinear optimal control with practical asymptotic stability guarantees

Contact Info

Product

Resources

About