2018
DOI: 10.1016/j.sysconle.2018.06.005
|View full text |Cite
|
Sign up to set email alerts
|

Stochastic L1-optimal control via forward and backward sampling

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 30 publications
(37 citation statements)
references
References 13 publications
0
36
0
Order By: Relevance
“…It is easy to verify that the PDE associated with the new system is the same as the original one (12). For the full derivation of change of measure for FBSDEs, we refer readers to proof of Theorem 1 in [14].…”
Section: Importance Samplingmentioning
confidence: 99%
“…It is easy to verify that the PDE associated with the new system is the same as the original one (12). For the full derivation of change of measure for FBSDEs, we refer readers to proof of Theorem 1 in [14].…”
Section: Importance Samplingmentioning
confidence: 99%
“…Such control constraints are common in mechanical systems, where control forces and/or torques are bounded, and may be readily introduced in our framework via the addition of a "soft" constraint, integrated within the cost functional. In recent work, Exarchos et al [26] showed how box-type control constraints for L 1 -optimal control problems (also called minimum fuel problems), can be incorporated into an FBSDE scheme. These are in contrast to the more frequently used quadratic control cost (L 2 or minimum energy) SOC problems.…”
Section: Fbsde Reformulationmentioning
confidence: 99%
“…The initial valueỹ i 0 and its gradientz i 0 are parameterized by trainable variables φ and are randomly initialized. The optimal control action is calculated using the discretized version of (6) (or (26) for the control constrained case). The dynamicsx and value functionỹ are propagated using the Euler integration scheme, as shown in the algorithm.…”
Section: Deep Fbsde Controllermentioning
confidence: 99%
See 1 more Smart Citation
“…Usual difficulties involve the solution of the Hamilton-Jacobi-Bellman (HJB) equations associated to the optimal control problem: in [5], the stochastic HJB equation is iteratively solved with successive approximations; in [6], the infinitetime HJB equation is reformulated as an eigenvalue problem; in [7], a transformation approach is proposed for solving the HJB equation arising in quadratic-cost control for nonlinear deterministic and stochastic systems. Finally, in a pair of recent papers, a solution to the nonlinear HJB equation is provided, by expressing it in the form of decoupled Forward and Backward Stochastic Differential Equations (FBSDEs), for an L 2 -and an L 1 -type optimal control setting (see [8,9], respectively). As stated above, the solutions proposed in these references rely on a complete knowledge of the state of the system; thus, they do not require any nonlinear stateestimation algorithm to infer information from noisy measurements.…”
Section: Introductionmentioning
confidence: 99%