2017 IEEE International Conference on Robotics and Automation (ICRA) 2017
DOI: 10.1109/icra.2017.7989202
|View full text |Cite
|
Sign up to set email alerts
|

Information theoretic MPC for model-based reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
395
0
2

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 400 publications
(398 citation statements)
references
References 16 publications
1
395
0
2
Order By: Relevance
“…This platform is approximately 1 meter long, weighs over 20 kilograms, and has a top speed over 20 m/s. Previous works have demonstrated that the MPPI controller (with tuned soft cost terms) is capable of navigating this type of vehicle around a simple elliptical track [25,26], which we did our best to match in our simulation experiments. Our real-world experiments use the same type of vehicle as these prior works, but in a more challenging environment (Fig.…”
Section: /5 Scale Autonomous Racing Experimentsmentioning
confidence: 80%
See 2 more Smart Citations
“…This platform is approximately 1 meter long, weighs over 20 kilograms, and has a top speed over 20 m/s. Previous works have demonstrated that the MPPI controller (with tuned soft cost terms) is capable of navigating this type of vehicle around a simple elliptical track [25,26], which we did our best to match in our simulation experiments. Our real-world experiments use the same type of vehicle as these prior works, but in a more challenging environment (Fig.…”
Section: /5 Scale Autonomous Racing Experimentsmentioning
confidence: 80%
“…There are then 3 components of the Tube-MPC algorithm that we need: (1) a nominal controller, (2) a method for setting the nominal state, and (3) an ancillary controller. We use an information theoretic interpretation of model predictive path integral control (MPPI) [26], so we will hereon refer to our method as Tube-MPPI.…”
Section: Robust Sampling Based Mpcmentioning
confidence: 99%
See 1 more Smart Citation
“…Model Predictive Control (MPC)-based optimal controllers (e.g. Model Predictive Path Integral (MPPI) [19]) provide planned control trajectories given an initial state and a cost function by solving the optimal control problem. An optimal control problem whose objective is to minimize a task-specific cost function J(X, U) can be formulated as follows:…”
Section: A Model Predictive Optimal Controlmentioning
confidence: 99%
“…This can be solved in a receding horizon fashion in an MPC framework and it allows us to have a real-time optimal controller with feedback. In our work, a sampling-based receding-horizon stochastic optimization algorithm, MPPI controller [19] is used as an MPC controller. We chose MPPI for several reasons, first off being the generality of cost functions and dynamics allowed.…”
Section: A Model Predictive Optimal Controlmentioning
confidence: 99%