2020
DOI: 10.48550/arxiv.2009.12678
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

I Like to Move It: 6D Pose Estimation as an Action Decision Process

Benjamin Busam,
Hyun Jun Jung,
Nassir Navab

Abstract: Object pose estimation is an integral part of robot vision and augmented reality. Robust and accurate pose prediction of both object rotation and translation is a crucial element to enable precise and safe human-machine interactions and to allow visualization in mixed reality. Previous 6D pose estimation methods treat the problem either as a regression task or discretize the pose space to classify. We reformulate the problem as an action decision process where an initial pose is updated in incremental discrete… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 72 publications
0
10
0
Order By: Relevance
“…The superior performance of SE(3)-TrackerNet mostly comes from the rotation-translation disentanglement scheme, a Lie Algebra rotation representation and the domain randomization data enhancement strategy [69]. Beyond the above methods, Busam et al [151] propose "I Like to Move It", which learns a decision process to iteratively determine an acceptable final pose. They build a decision process where an initial pose is updated in incremental discrete steps.…”
Section: Instance Level Monocular Object Pose Trackingmentioning
confidence: 99%
“…The superior performance of SE(3)-TrackerNet mostly comes from the rotation-translation disentanglement scheme, a Lie Algebra rotation representation and the domain randomization data enhancement strategy [69]. Beyond the above methods, Busam et al [151] propose "I Like to Move It", which learns a decision process to iteratively determine an acceptable final pose. They build a decision process where an initial pose is updated in incremental discrete steps.…”
Section: Instance Level Monocular Object Pose Trackingmentioning
confidence: 99%
“…While the accuracy of estimating the 6D pose keeps steadily increasing, it also comes with a heavy burden in annotating data for 3D CAD model and 6D pose, which scales poorly to multiple objects, often rendering it impractical for real applications [37], [38], [39]. In contrast, our method only needs a short live demonstration of an human-object interaction in order to reliably interact with novel objects.…”
Section: A Model-based Graspingmentioning
confidence: 99%
“…Reinforcement Learning in Object Pose Estimation: In the related domain of object pose estimation, Krull et al [11] use RL to find a policy that efficiently allocates refinement iterations to a pool of pose hypotheses. Closely related to our approach, in RGB-based object pose estimation [24,5], RL is used to train policies that manipulate an object's pose. Based on 2D segmentation masks, these agents learn to predict discrete refinement actions.…”
Section: Related Workmentioning
confidence: 99%
“…In an effort to bridge this gap between methods, we design a novel registration approach that unifies accuracy, robustness to noise and initialization with inference speed. While reinforcement learning methods for RGB-based object pose refinement are proposed [24,5], to the best of our knowledge, we are the first to consider 3D point cloud registration as a reinforcement learning problem. Our approach is based on a combination of Imitation Learning (IL) and Reinforcement Learning (RL); imitating an expert to learn an accurate initial policy, reinforcing a symmetry-invariant reward to further improve the policy.…”
Section: Introductionmentioning
confidence: 99%