2020 21st International Conference on Research and Education in Mechatronics (REM) 2020
DOI: 10.1109/rem49740.2020.9313886
|View full text |Cite
|
Sign up to set email alerts
|

Expandable YOLO: 3D Object Detection from RGB-D Images

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(12 citation statements)
references
References 9 publications
0
12
0
Order By: Relevance
“…Most deep learning stair detection methods [ 4 , 7 , 8 ] focus on extracting stair features in monocular vision through a CNN, and there is no deep learning method to make full use of the complementary relationship between the RGB map and the depth map for stair detection. Regarding the RGB-D fusion methods for deep learning, some methods fuse features in the input and output locations by simple summation and concatenation [ 14 , 15 , 16 , 17 , 18 ], and some methods design special modules to explore the implicit relationship between the two modalities [ 19 , 20 , 21 , 22 ]. This section briefly introduces some RGB-D-based stair detection methods and some RGB-D fusion methods for deep learning.…”
Section: Related Workmentioning
confidence: 99%
“…Most deep learning stair detection methods [ 4 , 7 , 8 ] focus on extracting stair features in monocular vision through a CNN, and there is no deep learning method to make full use of the complementary relationship between the RGB map and the depth map for stair detection. Regarding the RGB-D fusion methods for deep learning, some methods fuse features in the input and output locations by simple summation and concatenation [ 14 , 15 , 16 , 17 , 18 ], and some methods design special modules to explore the implicit relationship between the two modalities [ 19 , 20 , 21 , 22 ]. This section briefly introduces some RGB-D-based stair detection methods and some RGB-D fusion methods for deep learning.…”
Section: Related Workmentioning
confidence: 99%
“…This is in itself a vast field of research, fueled in the last years by the great interest of the big technological actors and the advent of deep learning. YOLO [ 81 ] represented a big leap for object detection in 2D images, and 3D versions have been proposed [ 82 , 83 , 84 ]. Other approaches based on 3D descriptors [ 85 , 86 , 87 ] or other deep learning architectures [ 88 , 89 , 90 ] have also been the subject of research.…”
Section: Applicationsmentioning
confidence: 99%
“…As a consequence, they use 2D CNN methods and do not fully exploit the 3D data geometry of the objects, although they achieve good recognition performances, especially in the case of occlusions [38]. An alternative method for object recognition by a depth camera is to include the depth channel along RGB channels (RGB-D) in combination with a 2D CNN [39] and recursive neural networks (RNNs) [40] or encode the depth channel in jet color maps and the surface of normals [41]. Consequently, RGB-D methods for 3D recognition do not fully exploit 3D geometric information, but they reduce the hardware requirements compared to model-based methods and thus enable real-time applications, which is the intended use for contactless PW control.…”
Section: Related Workmentioning
confidence: 99%