Robotics: Science and Systems XVII 2021
DOI: 10.15607/rss.2021.xvii.068
|View full text |Cite
|
Sign up to set email alerts
|

Dual Online Stein Variational Inference for Control and Dynamics

Abstract: Model predictive control (MPC) schemes have a proven track record for delivering aggressive and robust performance in many challenging control tasks, coping with nonlinear system dynamics, constraints, and observational noise. Despite their success, these methods often rely on simple control distributions, which can limit their performance in highly uncertain and complex environments. MPC frameworks must be able to accommodate changing distributions over system parameters, based on the most recent measurements… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 15 publications
(5 citation statements)
references
References 25 publications
0
5
0
Order By: Relevance
“…Continuing the direction of adaptive domain randomization, we are convinced that neural LFI powered by normalizing flows are auspicious approaches. The combination of highly flexible density estimators with widely applicable and sample-efficient inference methods allows one to identify multi-modal distributions over simulators with very mild assumptions ( Ramos et al, 2019 ; Barcelos et al, 2021 ; Muratore et al, 2021c ). By introducing an auxiliary optimality variable and making the policy parameters subject to the inference, we obtain the posterior over policies quantifying their likelihood of being optimal.…”
Section: Discussion and Outlookmentioning
confidence: 99%
“…Continuing the direction of adaptive domain randomization, we are convinced that neural LFI powered by normalizing flows are auspicious approaches. The combination of highly flexible density estimators with widely applicable and sample-efficient inference methods allows one to identify multi-modal distributions over simulators with very mild assumptions ( Ramos et al, 2019 ; Barcelos et al, 2021 ; Muratore et al, 2021c ). By introducing an auxiliary optimality variable and making the policy parameters subject to the inference, we obtain the posterior over policies quantifying their likelihood of being optimal.…”
Section: Discussion and Outlookmentioning
confidence: 99%
“…The same idea has been used before in MPPI Williams et al [2017], Wagener et al [2019]. Note that MPPI also uses multiple samples, but these samples all contribute to the update of a single action sequence, which can potentially limit the search , Barcelos et al [2021]. To allow reuse of old search but add diversity, we initialize one restart using the old actions shifted one time step, and initialize all other restarts randomly.…”
Section: Optimization Algorithmmentioning
confidence: 99%
“…Two widely used sampling based MPC techniques, MPPI [36] and CEM [15], use importance sampling to generate low-cost control sequences, and have strong connections to the inference formulation of SOC which was explored in [34]. Several recent papers have considered the SOC problem as Bayesian inference, and proposed methods of performing Variational Inference (VI) to approximate a posterior over low-cost control sequences [17,34,23,3]. These methods differ in how they represent the variational posterior.…”
Section: A Planning and Control As Inferencementioning
confidence: 99%
“…VI methods often use an independent Gaussian posterior, known as the mean-field approximation [4]. Okada and Taniguchi [23] represent the control sequence as a Gaussian mixture, and Lambert et al [17] use a particle representation, extended to handle parameter uncertainty in [3]. These representations allow for greater flexibility in representing complex posteriors.…”
Section: A Planning and Control As Inferencementioning
confidence: 99%
See 1 more Smart Citation