2020
DOI: 10.1109/access.2020.3034386
|View full text |Cite
|
Sign up to set email alerts
|

Detecting 6D Poses of Target Objects From Cluttered Scenes by Learning to Align the Point Cloud Patches With the CAD Models

Abstract: 6D target object detection is of great importance to many applications such as robotics, industrial automation, and unmanned vehicles and is increasingly influencing broad industries including manufacturing, transportation, and retail industries, to name a few. Unlike the more common object detection methods that use the two-dimensional data such as RGB or depth images, the method proposed here relies on the three-dimensional point cloud of target objects to detect them in cluttered scenes in an end-to-end fas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(15 citation statements)
references
References 23 publications
0
15
0
Order By: Relevance
“…Since only point cloud is required to be processed, the network architecture proposed by Gao et al [77] is much more lightweight than methods that take both RGB data and depth data as input. Then, Chen et al [96] use segmented object point cloud patches from the target object's CAD model as input and form a pose prediction model without the need for fine-tuning, thus saving computational cost. Additionally, G2L-Net [78] decouples the prediction pipeline into global localization, translation localization, and rotation localization and estimates object pose through a coarse-to-fine manner without using a CAD model, achieving real-time performance.…”
Section: (Rgb)d-based Methodsmentioning
confidence: 99%
“…Since only point cloud is required to be processed, the network architecture proposed by Gao et al [77] is much more lightweight than methods that take both RGB data and depth data as input. Then, Chen et al [96] use segmented object point cloud patches from the target object's CAD model as input and form a pose prediction model without the need for fine-tuning, thus saving computational cost. Additionally, G2L-Net [78] decouples the prediction pipeline into global localization, translation localization, and rotation localization and estimates object pose through a coarse-to-fine manner without using a CAD model, achieving real-time performance.…”
Section: (Rgb)d-based Methodsmentioning
confidence: 99%
“…e top and bottom can be designed by the user or the design stored in the system. In either case, it should be a reasonable stacking of the top and bottom [11].…”
Section: Overall System Design and Functionmentioning
confidence: 99%
“…Latest research on object detection for robotics mainly focuses on improving precision in particularly difficult scenarios such as, occlusion and clutter [3], [4], [5], [6], [7]. To this aim, a widely used approach is to rely on deep learning based methods (like e.g.…”
Section: Related Workmentioning
confidence: 99%