2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00777
|View full text |Cite
|
Sign up to set email alerts
|

CDPN: Coordinates-Based Disentangled Pose Network for Real-Time RGB-Based 6-DoF Object Pose Estimation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
310
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 393 publications
(310 citation statements)
references
References 17 publications
0
310
0
Order By: Relevance
“…Methods establishing the correspondences in the opposite direction, i.e. by predicting the 3D object coordinates [4] for a densely sampled set of pixels, have been also proposed [32,46,69,48,39]. As discussed below, none of the existing correspondence-based methods can reliably handle pose ambiguity due to object symmetries.…”
Section: Related Workmentioning
confidence: 99%
“…Methods establishing the correspondences in the opposite direction, i.e. by predicting the 3D object coordinates [4] for a densely sampled set of pixels, have been also proposed [32,46,69,48,39]. As discussed below, none of the existing correspondence-based methods can reliably handle pose ambiguity due to object symmetries.…”
Section: Related Workmentioning
confidence: 99%
“…CDPN (Li et al, 2019) uses a detector as a first stage to detect the object in the image. On the second stage the proposed Coordinates-based Disentangled Pose Network splits the computation into two paths: The first regresses the object translation, the second regresses 3D coordinates for all object pixels and uses PnP to compute the object rotation.…”
Section: Related Workmentioning
confidence: 99%
“…Given a known grasp configuration for an object in its local coordinate system, the task of grasping is simplified to estimating the pose of the object such that the grasp pose is transformed into the new scene. Traditional methods identify hand-crafted features to localize an object model within a scene (Klank et al, 2009 ; Srinivasa et al, 2010 ; Chitta et al, 2012a ) but more recently advances for pose estimation have been made by the application of deep learning (Xiang et al, 2018 ; Li et al, 2019 ; Park et al, 2019b ; Zakharov et al, 2019 ) and grasping pipelines achieve high success rate (Tremblay et al, 2018 ; Wang C. et al, 2019 ). The main limitation of this direction of research, however, is the closed-world assumption.…”
Section: Related Workmentioning
confidence: 99%
“…In order to make this extension, we employ the normalized object coordinate space that has been used to estimate the 6D pose of instances (Li et al, 2019 ; Park et al, 2019b ) and classes (Wang H. et al, 2019 ). Since NOC values represent coordinate values in the object's local frame and correspondences between the object model and the scene, predicting NOC values is sufficient for computing the transformation between local points from one observation to another.…”
Section: Related Workmentioning
confidence: 99%