2013 IEEE International Conference on Robotics and Automation 2013
DOI: 10.1109/icra.2013.6630859
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal cue integration through Hypotheses Verification for RGB-D object recognition and 6DOF pose estimation

Abstract: This paper proposes an effective algorithm for recognizing objects and accurately estimating their 6DOF pose in scenes acquired by a RGB-D sensor. The proposed method is based on a combination of different recognition pipelines, each exploiting the data in a diverse manner and generating object hypotheses that are ultimately fused together in an Hypothesis Verification stage that globally enforces geometrical consistency between model hypotheses and the scene. Such a scheme boosts the overall recognition perfo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
55
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 84 publications
(56 citation statements)
references
References 14 publications
1
55
0
Order By: Relevance
“…To identify learnt objects within the environment, we use the object recognizer from [20]. To exploit the strengths of different feature descriptors, this recognizer extracts multiple features in parallel pipelines, generates object hypotheses by a graph-based grouping of merged feature correspondences, and verifies these object hypotheses by finding a global optimum solution that best explains the scene in terms of number of metrics.…”
Section: Object Recognitionmentioning
confidence: 99%
“…To identify learnt objects within the environment, we use the object recognizer from [20]. To exploit the strengths of different feature descriptors, this recognizer extracts multiple features in parallel pipelines, generates object hypotheses by a graph-based grouping of merged feature correspondences, and verifies these object hypotheses by finding a global optimum solution that best explains the scene in terms of number of metrics.…”
Section: Object Recognitionmentioning
confidence: 99%
“…Object recognition and pose estimation has received widespread attention from the computer vision and robotics communities. With the recent advances in RGB-D cameras, several systems have been developed to detect object types/instances and their 6-D poses from 3-D point clouds (Rusu et al, 2010;Glover et al, 2011;Lai et al, 2012;Aldoma et al, 2013;Marton et al, 2014). We will use one such detector (Glover and Popovic, 2013) as our black-box attribute detector, but we emphasize that our methods are agnostic to the detector used.…”
Section: Semantic World Modelingmentioning
confidence: 99%
“…In addition, an Iteractive Closest Point (ICP) algorithm for pose refinement introduced in [12] and extended in [48] was applied to point clouds. Finally, to reduce the number of false positives, a Hypotheses Verification (HV) step introduced in [3] was extended in [4] by additional hypotheses generation pipelines with the use of SIRF and OUR-CVFH descriptors. Aldoma et al adapted the solution from [4] to a multi-view recognition system [7].…”
Section: Related Workmentioning
confidence: 99%
“…Finally, to reduce the number of false positives, a Hypotheses Verification (HV) step introduced in [3] was extended in [4] by additional hypotheses generation pipelines with the use of SIRF and OUR-CVFH descriptors. Aldoma et al adapted the solution from [4] to a multi-view recognition system [7]. In [43] Tang et al demonstrated advantages of simultaneous segmentation, object detection and 3D pose recovery in extracted global and local feature models acquired from multi-view color images and point clouds.…”
Section: Related Workmentioning
confidence: 99%