2018 IEEE International Conference on Robotics and Automation (ICRA) 2018
DOI: 10.1109/icra.2018.8461251
|View full text |Cite
|
Sign up to set email alerts
|

UnDeepVO: Monocular Visual Odometry Through Unsupervised Deep Learning

Abstract: We propose a novel monocular visual odometry (VO) system called UnDeepVO in this paper. UnDeepVO is able to estimate the 6-DoF pose of a monocular camera and the depth of its view by using deep neural networks. There are two salient features of the proposed UnDeepVO: one is the unsupervised deep learning scheme, and the other is the absolute scale recovery. Specifically, we train UnDeepVO by using stereo image pairs to recover the scale but test it by using consecutive monocular images. Thus, UnDeepVO is a mon… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
373
0
5

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3
3

Relationship

1
8

Authors

Journals

citations
Cited by 476 publications
(379 citation statements)
references
References 18 publications
1
373
0
5
Order By: Relevance
“…The network architecture consists of CNN and LSTM, and the LSTM was proved to help reduce noises by incorporating the temporal information from past sequences. Li et al [14] presented UnDeepVO to estimate the 6-DOF poses of a monocular camera and the depth of its view. Stereo image pairs were harnessed for recovering absolute scale and loss function was defined on spatial and temporal dense information.…”
Section: A Deep Learning For Visual Odometrymentioning
confidence: 99%
“…The network architecture consists of CNN and LSTM, and the LSTM was proved to help reduce noises by incorporating the temporal information from past sequences. Li et al [14] presented UnDeepVO to estimate the 6-DOF poses of a monocular camera and the depth of its view. Stereo image pairs were harnessed for recovering absolute scale and loss function was defined on spatial and temporal dense information.…”
Section: A Deep Learning For Visual Odometrymentioning
confidence: 99%
“…[2] utilizes synthetic data and style transfer [55] to collect more diverse training data. [26] uses stereo image pairs to recover the scale.…”
Section: Related Workmentioning
confidence: 99%
“…In contrast to the previous approaches that utilize the ground truth depth as supervision, some works [6], [19] utilize the task of view synthesis as supervision for single-view depth estimation. This view synthesis approach was extended to unsupervised end-to-end learning of both depth and egomotion estimation [12], [20], [21], [22]. These methods simultaneously train a depth and a pose network on temporally contiguous frames with a photometric loss between the target and a synthesized nearby view.…”
Section: A Depth From a Single Imagementioning
confidence: 99%