2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2020
DOI: 10.1109/iros45743.2020.9341314
|View full text |Cite
|
Sign up to set email alerts
|

se(3)-TrackNet: Data-driven 6D Pose Tracking by Calibrating Image Residuals in Synthetic Domains

Abstract: Tracking the 6D pose of objects in video sequences is important for robot manipulation. This work presents se(3)-TrackNet, a data-driven optimization approach for longterm, 6D pose tracking. It aims to identify the optimal relative pose given the current RGB-D observation and a synthetic image conditioned on the previous best estimate and the object's model. The key contribution in this context is a novel neural network architecture, which appropriately disentangles the feature encoding to help reduce domain s… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
101
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 114 publications
(101 citation statements)
references
References 28 publications
0
101
0
Order By: Relevance
“…This work leverages recent advances in visual tracking that employ temporal cues to dynamically update the 6D pose of tracked objects. In particular, recent work in visual tracking [15] achieves robust and accurate enough estimates at a low latency to work with a wide range of objects. This allows easy integration of visual tracking with planning and control for closing the feedback loop.…”
Section: A Visual 6d Object Pose Tracking Based On Synthetic Trainingmentioning
confidence: 99%
See 3 more Smart Citations
“…This work leverages recent advances in visual tracking that employ temporal cues to dynamically update the 6D pose of tracked objects. In particular, recent work in visual tracking [15] achieves robust and accurate enough estimates at a low latency to work with a wide range of objects. This allows easy integration of visual tracking with planning and control for closing the feedback loop.…”
Section: A Visual 6d Object Pose Tracking Based On Synthetic Trainingmentioning
confidence: 99%
“…In order to generate the paired input image I t−1 for the network, the process randomly samples a Gaussian relative motion transformation T t t−1 ∈ SE(3) centered on the identity relative transform to render the prior frame I t−1 . During training, data augmentations involving random HSV shift, Gaussian noise, Gaussian blur, and depth-sensing corruption are applied to the RGB and depth data in frame I t , following bi-directional domain alignment techniques [15]. Training on synthetic data takes 250 epochs and is readily applicable to real world scenarios without fine-tuning.…”
Section: A Visual 6d Object Pose Tracking Based On Synthetic Trainingmentioning
confidence: 99%
See 2 more Smart Citations
“…PoseCNN [10] proposed a network that regresses the center of object and regresses the 3D center distance from the camera directly. TrackNet [35] and DeepIM [36] captured the discrepancy between the current and previous images and used a network to estimate the pose residuals. But they required an initial pose to be estimated at the start of the process in order to make iterative refinement.…”
Section: Related Workmentioning
confidence: 99%