2018 IEEE International Conference on Robotics and Automation (ICRA) 2018
DOI: 10.1109/icra.2018.8461184
|View full text |Cite
|
Sign up to set email alerts
|

SE3-Pose-Nets: Structured Deep Dynamics Models for Visuomotor Control

Abstract: In this work, we present an approach to deep visuomotor control using structured deep dynamics models. Our deep dynamics model, a variant of SE3-Nets, learns a low-dimensional pose embedding for visuomotor control via an encoder-decoder structure. Unlike prior work, our dynamics model is structured: given an input scene, our network explicitly learns to segment salient parts and predict their poseembedding along with their motion modeled as a change in the pose space due to the applied actions. We train our mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
61
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
4
1

Relationship

2
7

Authors

Journals

citations
Cited by 56 publications
(61 citation statements)
references
References 22 publications
(35 reference statements)
0
61
0
Order By: Relevance
“…Different variants of deep learning have been shown to successfully learn predictive physics models and robot control policies in a purely data-driven way (Agrawal et al, 2016; Byravan et al, 2018; Jonschkowski et al, 2017; Watter et al, 2015). Although such a learning-based paradigm could potentially inherit the robustness of intuitive physics reasoning, current approaches are nowhere near human prediction and control capabilities.…”
Section: Deep Learning and Physics-based Modelsmentioning
confidence: 99%
“…Different variants of deep learning have been shown to successfully learn predictive physics models and robot control policies in a purely data-driven way (Agrawal et al, 2016; Byravan et al, 2018; Jonschkowski et al, 2017; Watter et al, 2015). Although such a learning-based paradigm could potentially inherit the robustness of intuitive physics reasoning, current approaches are nowhere near human prediction and control capabilities.…”
Section: Deep Learning and Physics-based Modelsmentioning
confidence: 99%
“…Some studies use an autoencoder to learn a low-dimensional latent representation of input images and then perform specific tasks based on the latent representation. Byravan et al [2] design SE3-Pose-NETS, which learns a lowdimensional pose embedding for visuomotor control via an autoencoder structure. The low-dimensional pose is used as the input to a three-layer network that is used to predict the 6D pose.…”
Section: Related Workmentioning
confidence: 99%
“…Our work is mostly inspired by previous works on learning control and dynamics in the latent space [9], [10]. Both of these works learn a latent space representation of the state, and also learn a dynamics model in the latent space.…”
Section: Related Workmentioning
confidence: 99%