2020
DOI: 10.1007/978-3-030-58548-8_38
|View full text |Cite
|
Sign up to set email alerts
|

Neural Object Learning for 6D Pose Estimation Using a Few Cluttered Images

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(10 citation statements)
references
References 44 publications
0
10
0
Order By: Relevance
“…Labeled w/ GT 6D Pose is a tedious, costly and labor-intensive chore. Another category of works [25,34,41] makes use of the available object model to reduce the labeling effort. They render images with the CAD model and train the pose estimator on the synthetic images.…”
Section: Training Inferencementioning
confidence: 99%
“…Labeled w/ GT 6D Pose is a tedious, costly and labor-intensive chore. Another category of works [25,34,41] makes use of the available object model to reduce the labeling effort. They render images with the CAD model and train the pose estimator on the synthetic images.…”
Section: Training Inferencementioning
confidence: 99%
“…Each predicted hypothesis corresponded to a single 3D translation and rotation, and estimated hypotheses collapsed onto the same valid pose when the object appearance was unique. Other recent variations of this method include Trabelsi et al [45], who used a multi-task CNN-based encoder/multi-decoder network, and Wang et al [48] and [20,34,43], who used a rendering method by a self-supervised model on unannotated real RGB-D data to find a visually and geometrically optimal alignment.…”
Section: Related Workmentioning
confidence: 99%
“…There were five different network architectures proposed in the intial ResNet paper [13]. While some 6 DoF pose recovery works use variations of ResNet-18 [36,47,50,45] others use ResNet-50 [48,34]. Some customize the structure by converting it to encoder [45,34,50,47], adding extra layers and skip connections [36] while others use the original ResNet unaltered [14,33].…”
Section: D1 Number Of Skip Connectionsmentioning
confidence: 99%
“…Without depth information, these methods suffer dramatic performance degradation. NOL [28] generates novel views of a given object from a few cluttered real-world images in which object poses are needed. The generated views are self-supervised by the input images.…”
Section: Related Workmentioning
confidence: 99%