2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC) 2017
DOI: 10.1109/itsc.2017.8317942
|View full text |Cite
|
Sign up to set email alerts
|

Fast semi-dense 3D semantic mapping with monocular visual SLAM

Abstract: The bundle of geometry and appearance in computer vision has proven to be a promising solution for robots across a wide variety of applications. Stereo cameras and RGB-D sensors are widely used to realise fast 3D reconstruction and trajectory tracking in a dense way. However, they lack flexibility of seamless switch between different scaled environments, i.e., indoor and outdoor scenes. In addition, semantic information are still hard to acquire in a 3D mapping. We address this challenge by combining the state… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
54
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 52 publications
(54 citation statements)
references
References 19 publications
0
54
0
Order By: Relevance
“…There has been a great interest from the computer vision and robotics communities to exploit object-level information since from the perspective of many applications, it is beneficial to explore the awareness that object instances can provide for assistive computer vision [7,8,9], tracking/SLAM [10,11], or place categorization/scene recognition and life-long mapping [12,13].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…There has been a great interest from the computer vision and robotics communities to exploit object-level information since from the perspective of many applications, it is beneficial to explore the awareness that object instances can provide for assistive computer vision [7,8,9], tracking/SLAM [10,11], or place categorization/scene recognition and life-long mapping [12,13].…”
Section: Related Workmentioning
confidence: 99%
“…The segmented labels are then projected/registered into the 3D reconstructed point cloud. Similarly, Li and Belaroussi [10] provided a 3D semantic mapping system from monocular images. Their methodology is based on LSD-SLAM [32], which estimates a semi-dense 3D reconstruction of the scene and performs camera localization from monocular images.…”
Section: Slam and Augmented Semantic Representationsmentioning
confidence: 99%
“…Cheng et al [31] applied ORB-SLAM to get real-scale 3D visual maps and CRF-RNN algorithm for semantic segmentation. In [32], this challenge was solved by combining the stateof-the-art deep learning algorithms and semi-dense SLAM based on a monocular camera. 2D semantic information are transferred to 3D mapping via correspondence between connective Keyframes with spatial consistency.…”
Section: Related Workmentioning
confidence: 99%
“…Their main contribution was an efficient spatial regularizing Conditional Random Field (CRF), which smoothes semantic labels throughout the point cloud. Li and Belaroussi (2016) extended this approach to monocular video while using the semi-dense map of LSD-SLAM (Engel et al, 2014). Here, the DeepLab-CNN (Chen et al, 2018) was used instead of a random forest for segmentation.…”
Section: Figmentioning
confidence: 99%