2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022
DOI: 10.1109/iros47612.2022.9982268
|View full text |Cite
|
Sign up to set email alerts
|

MV6D: Multi-View 6D Pose Estimation on RGB-D Frames Using a Deep Point-wise Voting Network

Abstract: Estimating 6D poses of objects is an essential computer vision task. However, most conventional approaches rely on camera data from a single perspective and therefore suffer from occlusions. We overcome this issue with our novel multi-view 6D pose estimation method called MV6D which accurately predicts the 6D poses of all objects in a cluttered scene based on RGB-D images from multiple perspectives. We base our approach on the PVN3D network that uses a single RGB-D image to predict keypoints of the target obje… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 62 publications
0
3
0
Order By: Relevance
“…[16] improves categorylevel object pose and size prediction by modifying pose consistency learning. [17] uses a deep multi-directional fusion network to efficiently fuse RGB-D frames from multiple perspectives.…”
Section: B Rgb-d Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…[16] improves categorylevel object pose and size prediction by modifying pose consistency learning. [17] uses a deep multi-directional fusion network to efficiently fuse RGB-D frames from multiple perspectives.…”
Section: B Rgb-d Methodsmentioning
confidence: 99%
“…Currently, the academic community is increasingly focusing on deep learning based 6D pose estimation methods. However, most of these methods predominantly rely on RGB [3]- [8] or RGB-D [9], [11]- [17] information because they require the rich features provided by RGB information for subsequent predictions. While these methods might be effective in scenarios involving daily object grasping, they struggle with industrial object grasping becauseof the following reasons.…”
Section: Introductionmentioning
confidence: 99%
“…Recent studies on multi-view pose estimation have seen multiple approaches employing neural networks to tackle this task. MV6D predicts the 6D pose from multiple viewpoints and constitutes an end-to-end approach [ 26 ], eliminating the need for multiple prediction stages. Nevertheless, it relies on RGB-D data.…”
Section: Related Workmentioning
confidence: 99%