2021 International Conference on Content-Based Multimedia Indexing (CBMI) 2021
DOI: 10.1109/cbmi50038.2021.9461875
|View full text |Cite
|
Sign up to set email alerts
|

Review on Indoor RGB-D Semantic Segmentation with Deep Convolutional Neural Networks

Abstract: Many research works focus on leveraging the complementary geometric information of indoor depth sensors in vision tasks performed by deep convolutional neural networks, notably semantic segmentation. These works deal with a specific vision task known as "RGB-D Indoor Semantic Segmentation". The challenges and resulting solutions of this task differ from its standard RGB counterpart. This results in a new active research topic. The objective of this paper is to introduce the field of Deep Convolutional Neural N… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 26 publications
0
6
0
Order By: Relevance
“…The recall rate is the ratio of one target box to all target boxes that the model predicts correctly. In general, the recall rate and accuracy rate are difficult to be at a high level, so the parameters are extracted to measure the network performance [15] .…”
Section: Evaluation Indicatorsmentioning
confidence: 99%
“…The recall rate is the ratio of one target box to all target boxes that the model predicts correctly. In general, the recall rate and accuracy rate are difficult to be at a high level, so the parameters are extracted to measure the network performance [15] .…”
Section: Evaluation Indicatorsmentioning
confidence: 99%
“…With the development of low-cost depth sensors, there are increasingly more RGB-D-based deep learning methods in computer vision. However, there is no established methodology to perfectly fuse these two modalities inside a CNN [ 23 ]. Some common fusion methods are as follows.…”
Section: Related Workmentioning
confidence: 99%
“…The taxonomy we adopt in this work is closer to the one proposed by Barchid et al [51]. The authors classify RGB-D methods according to how depth data is used in the network, into: depth as input, depth as operation and depth as prediction -Fig.…”
Section: Rgb-d Semantic Segmentationmentioning
confidence: 99%