2019
DOI: 10.1109/access.2019.2926350
|View full text |Cite
|
Sign up to set email alerts
|

MagicVO: An End-to-End Hybrid CNN and Bi-LSTM Method for Monocular Visual Odometry

Abstract: For the robotic positioning and navigation, visual odometry (VO) system is widely used. However, the errors of the traditional VO accumulate when the robot moves. Besides, this paper proposes a new framework to solve the problem of monocular VO, called MagicVO. Based on the convolutional neural network (CNN) and the bi-directional LSTM (Bi-LSTM), MagicVO outputs a 6-DoF absolute-scale pose at each position of the camera with a sequence of continuous monocular images as input. It does not only utilize the outst… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
19
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 29 publications
(19 citation statements)
references
References 27 publications
0
19
0
Order By: Relevance
“…Monocular visual odometry is the task of tracking the flow of information from a single camera and predicting the path undertaken by it. Traditional methods [ 12 , 22 , 23 ] are solely insufficient and have to rely on external parameters to account for the ambiguities involved, such as scale, whereas deep learning approaches [ 24 , 25 ] are self-sufficient and show improved results compared to older traditional methods.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Monocular visual odometry is the task of tracking the flow of information from a single camera and predicting the path undertaken by it. Traditional methods [ 12 , 22 , 23 ] are solely insufficient and have to rely on external parameters to account for the ambiguities involved, such as scale, whereas deep learning approaches [ 24 , 25 ] are self-sufficient and show improved results compared to older traditional methods.…”
Section: Related Workmentioning
confidence: 99%
“…Jiao et al [ 25 ] proposed an iterative update to [ 24 ] using bi-directional LSTMs, and outperformed [ 24 ] on the KITTI dataset. Bi-directional LSTMs model relationships for both forward and backward sequences.…”
Section: Related Workmentioning
confidence: 99%
“…However, we use spherical cameras that have already been calibrated and design a distortion weight to address the distortion problem. Numerous self-(un)supervised visual odometry approaches have displayed the possibility of learning the network using unlabeled training data [13], [14], [18], [30]- [32]. These approaches attempted to provide supervision signals to train their networks using pseudo-labels generated from unlabeled training data.…”
Section: Related Workmentioning
confidence: 99%
“…This indicates the accuracy in test scenes cannot be guaranteed. In contrast, self-supervised learning approaches for deep learning tasks have recently attracted much attention [13]- [17]. These approaches do not require any explicitly labeled data for training, and thus present the possibility of optimizing the network using unlabeled data.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation