2021 IEEE International Conference on Robotics and Automation (ICRA) 2021
DOI: 10.1109/icra48506.2021.9561675
|View full text |Cite
|
Sign up to set email alerts
|

Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
61
0
1

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 172 publications
(62 citation statements)
references
References 29 publications
0
61
0
1
Order By: Relevance
“…First, our method heavily relies on semantic segmentation method, bad segmentation will lead to awful geometry and interlaced texture. We plan to address it by incorporating the efficient RGBD segmentation [40]. Besides, since depth sensors lack observations from specific materials such as shaggy hair and yarn clothes, our method cannot get good geometry to these areas.…”
Section: Limitationmentioning
confidence: 99%
“…First, our method heavily relies on semantic segmentation method, bad segmentation will lead to awful geometry and interlaced texture. We plan to address it by incorporating the efficient RGBD segmentation [40]. Besides, since depth sensors lack observations from specific materials such as shaggy hair and yarn clothes, our method cannot get good geometry to these areas.…”
Section: Limitationmentioning
confidence: 99%
“…Unlike other approaches [2, 20 -23], we do not apply any post-processing to the resulting maps, such as CRFs, as this would notably increase runtime. For semantic segmentation, we rely on our recently proposed RGB-D approach for efficient indoor scene analysis, called ESANet [8]. Semantic mapping can be performed using either semantic Bayesian kernel inference mapping (S-BKI), the proposed optimized version of this approach (OS-BKI), or the proposed semantic…”
Section: Efficient and Robust Semantic Mappingmentioning
confidence: 99%
“…Incorporating depth images can alleviate this effect by providing complementary geometric information. In [8], we have shown that RGB-D segmentation is superior to processing RGB images solely and that it can still be performed in real time if the network architecture is carefully designed. More precisely, we use ESANet-R34-NBt1D (enhanced ResNet34-based encoder utilizing the Non-Bottleneck-1D block (NBt1D) [27]) in our pipeline.…”
Section: A Semantic Segmentationmentioning
confidence: 99%
See 2 more Smart Citations