2019 International Conference on Robotics and Automation (ICRA) 2019
DOI: 10.1109/icra.2019.8794425
|View full text |Cite
|
Sign up to set email alerts
|

CNN-SVO: Improving the Mapping in Semi-Direct Visual Odometry Using Single-Image Depth Prediction

Abstract: Reliable feature correspondence between frames is a critical step in visual odometry (VO) and visual simultaneous localization and mapping (V-SLAM) algorithms. In comparison with existing VO and V-SLAM algorithms, semi-direct visual odometry (SVO) has two main advantages that lead to stateof-the-art frame rate camera motion estimation: direct pixel correspondence and efficient implementation of probabilistic mapping method. This paper improves the SVO mapping by initializing the mean and the variance of the de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
69
0
3

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 85 publications
(72 citation statements)
references
References 17 publications
0
69
0
3
Order By: Relevance
“…In addition, we also compare the performance of the proposed method with that of three CNN-based methods: Yin's method [16], Sucar's method [17], Yang's method [18], and Loo's method [19]. ORB-SLAM using the proposed scale estimation method with and without the guidance of the motion vector are referred to as GCSEMK (geometrically constrained scale estimation from multiple keyframes) and SEMK (scale estimation from multiple keyframes), respectively.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…In addition, we also compare the performance of the proposed method with that of three CNN-based methods: Yin's method [16], Sucar's method [17], Yang's method [18], and Loo's method [19]. ORB-SLAM using the proposed scale estimation method with and without the guidance of the motion vector are referred to as GCSEMK (geometrically constrained scale estimation from multiple keyframes) and SEMK (scale estimation from multiple keyframes), respectively.…”
Section: Resultsmentioning
confidence: 99%
“…Comparing with the KITTI dataset, the RobotCar dataset is difficult for the vSLAM algorithms because of significant illumination changes. For some sequences in the Robot-Car dataset, the state-of-the-art vSLAM algorithms such as SVO [27], DSO [29], and ORB-SLAM cannot estimate camera pose due to feature tracking failure [19]. Since ORB-SLAM was used as our baseline algorithm, we tested the proposed method on the sequence for which the ORB-SLAM system can produce camera poses and 3D points.…”
Section: A Implementation Detailsmentioning
confidence: 99%
See 2 more Smart Citations
“…Recent and state-of-the-art mono vSLAM/VO methods mainly use CNNs to resolve the scale ambiguity problem. The approaches of [19,26,25] and [14] have trained a CNN to deploy a scaled depth map from single monocular images. These dense depth maps are used to extend the optimization scheme of the classical frameworks, like for instance a so-called virtual stereo setup [25] for DSO or initializing depth filters [14] in SVO directly from the depth map, to eliminate the scale ambiguity.…”
Section: Classical Methodsmentioning
confidence: 99%