Robotics: Science and Systems XV 2019
DOI: 10.15607/rss.2019.xv.070
|View full text |Cite
|
Sign up to set email alerts
|

Learning Deep Stochastic Optimal Control Policies Using Forward-Backward SDEs

Abstract: In this paper we propose a new methodology for decision-making under uncertainty using recent advancements in the areas of nonlinear stochastic optimal control theory, applied mathematics, and machine learning. Grounded on the fundamental relation between certain nonlinear partial differential equations and forward-backward stochastic differential equations, we develop a control framework that is scalable and applicable to general classes of stochastic systems and decision-making problem formulations in roboti… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
23
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(23 citation statements)
references
References 22 publications
0
23
0
Order By: Relevance
“…Henceforth the stochastic control problem (12) subject to the dynamics (13), is referred to simply as problem (12). Put in words, the problem undertaken is to find a deterministic open loop optimal control for all s ∈ [t, T ] to minimize the cost (10) over [t, T ] given the PDE dynamics for p(s, x).…”
Section: Problem Statementmentioning
confidence: 99%
See 1 more Smart Citation
“…Henceforth the stochastic control problem (12) subject to the dynamics (13), is referred to simply as problem (12). Put in words, the problem undertaken is to find a deterministic open loop optimal control for all s ∈ [t, T ] to minimize the cost (10) over [t, T ] given the PDE dynamics for p(s, x).…”
Section: Problem Statementmentioning
confidence: 99%
“…The MFG case is further complicated due to the fully coupled nature of the HJB-FP system ( [7], [8], [9]). The first [10] and second ( [11], [12]) order forward-backward SDE (FBSDE) [1] framework has been applied to obtain algorithms for optimal control of dynamics with nonlinear drift and state multiplicative noise, but not in the case of control multiplicative Gaussian or the general case of non-Gaussian excitation [13].…”
Section: Introductionmentioning
confidence: 99%
“…The idea behind deep FBSDEs is to find a numerical approximation to the solution of the HJB equation, which is the value function given with respect to a cost function. Following the derivation in [15], the optimal control can then be calculated as a function of the partial derivative of the value function with respect to the state. We can formulate this problem as a FBSDE and solve it using a neural network which has the benefit of resolving compounding-errors.…”
Section: Introductionmentioning
confidence: 99%
“…Such a method has gained traction recently. [15] first proposed the deep FBSDE algorithm using LSTMs. On top of the vanilla LSTM-based FBSDE formulation, [16] showed how to handle systems with control multiplicative noise and [17] solved problems with unknown noise distribution.…”
Section: Introductionmentioning
confidence: 99%
“…The multiplicative noise severely affects stability and robustness [12] and the optimal control formulation in this case is non-standard due to the non-Gaussianity of the dynamics [9], and most works addressing either aspect in the linear-quadratic regime. In the case of nonlinear dynamics with multiplicative noise, differential dynamic programming [11] and forward backward stochastic differential equations using first [13] and second order schemes ( [14], [15]) have been used to synthesize algorithms to compute the control.…”
Section: Introductionmentioning
confidence: 99%