Open-Source, Object-Oriented, Multi-Phase Pseudospectral Optimization Using Pyomo

Schlossman, Rachel; Williams, Kyle A.; Kozlowski, David; Parish, Julie J.

doi:10.2514/6.2021-1951

Cited by 3 publications

(9 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In most cases the optimal control problem cannot be solved analytically. Rather, so-called direct transcription methods [1,42] or indirect methods [43] can be applied to solve the problem numerically, in which case the dynamics (1) can be approximated with a numerical discretization of the form…”

Section: A Optimal Controlmentioning

confidence: 99%

“…Trajectory generation is often formulated as an optimal control problem and then converted into a parameter optimization problem through direct transcription methods [1,42]. This produces a problem formulation that is compatible with modern nonlinear programming techniques [54].…”

Section: Problem Formulationmentioning

confidence: 99%

“…This produces a problem formulation that is compatible with modern nonlinear programming techniques [54]. For many problems this method works well [1,2], but long solution times and convergence stability can limit real-time application.…”

Section: Problem Formulationmentioning

confidence: 99%

“…The following shows that the best approximation error grows at a bounded linear rate along the horizon in the case where 𝑧 = 𝑦 𝑑𝑒𝑠 . (1) which is optimal with respect to (2). Define x(𝑡) = 𝑧 * (𝑡), where 𝑧 * (𝑡) is a piecewise polynomial as described in (9) over 𝑡 ∈ [𝑡 0 , 𝑡 𝐻 ] and best approximates 𝑓 (𝑥 * (𝑡), 𝑢 * (𝑡)) over 𝑡 ∈ [𝑡 0 , 𝑡 𝐻 ] in some sense.…”

Section: Accuracy Of Approximationmentioning

confidence: 99%

“…Many advances have been made in trajectory planning and optimization. Examples include nonlinear programming methods [1,2], sampling based methods [3][4][5],…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Trajectory Planning with Deep Reinforcement Learning in High-Level Action Spaces

Williams¹,

Schlossman²,

Whitten³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

This paper presentsa technique for trajectory planning based on continuously parameterized high-level actions (motion primitives) of variable duration. This technique leverages deep reinforcement learning (Deep RL) to formulate a policy which is suitable for real-time implementation. There is no separation of motion primitive generation and trajectory planning: each individual short-horizon motion is formed during the Deep RL training to achieve the full-horizon objective. Effectiveness of the technique is demonstrated numerically on a well-studied trajectory generation problem and a planning problem on a known obstacle-rich map. This paper also develops a new loss function term for policy-gradient-based Deep RL, which is analogous to an anti-windup mechanism in feedback control. We demonstrate the inclusion of this new term in the underlying optimization increases the average policy return in our numerical example.

show abstract

Section: A Optimal Controlmentioning

confidence: 99%

Section: Problem Formulationmentioning

confidence: 99%

Section: Problem Formulationmentioning

confidence: 99%

Section: Accuracy Of Approximationmentioning

confidence: 99%

“…Many advances have been made in trajectory planning and optimization. Examples include nonlinear programming methods [1,2], sampling based methods [3][4][5],…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations