2017
DOI: 10.1177/0278364917734298
|View full text |Cite
|
Sign up to set email alerts
|

End-to-end, sequence-to-sequence probabilistic visual odometry through deep neural networks

Abstract: This paper studies visual odometry (VO) from the perspective of deep learning. After tremendous efforts in the robotics and computer vision communities over the past few decades, state-of-the-art VO algorithms have demonstrated incredible performance. However, since the VO problem is typically formulated as a pure geometric problem, one of the key features still missing from current VO systems is the capability to automatically gain knowledge and improve performance through learning. In this paper, we investig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
192
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 210 publications
(192 citation statements)
references
References 71 publications
(117 reference statements)
0
192
0
Order By: Relevance
“…To fine-tune the network, we used the original image size in computing the losses. For data prepossessing, different kinds of data augmentation methods were used to enhance the performance and mitigate possible over-fitting, such as image color augmentation [5], rotational data augmentation [24] and left-right pose estimation augmentation [4]. We increased the weight parameter of rotational data to achieve better performance because the magnitude of rotation is very small compared to that of translation.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…To fine-tune the network, we used the original image size in computing the losses. For data prepossessing, different kinds of data augmentation methods were used to enhance the performance and mitigate possible over-fitting, such as image color augmentation [5], rotational data augmentation [24] and left-right pose estimation augmentation [4]. We increased the weight parameter of rotational data to achieve better performance because the magnitude of rotation is very small compared to that of translation.…”
Section: Methodsmentioning
confidence: 99%
“…The Discriminator training procedure is to make the W (x,x) convergent to the minimum. The above procedure is repeated [24], SfMlearner [5]; Stereo: UndeepVO [9], Undepthflow [11]), and one geometric based methods (ORB-SLAM). Our SGANVO, SfMLearner, and UndeepVO use images with 416×128.…”
Section: Adversarial Training Proceduresmentioning
confidence: 99%
See 1 more Smart Citation
“…The Refining module ameliorates previous outputs by employing a spatialtemporal feature reorganization mechanism. 22,[31][32][33]. Due to the high dimensionality of depth maps, the number of frames is commonly limited to no more than 5.…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, all these VIO systems require careful parameter tuning procedures for the specific environment they have to work in. 1 In recent years, deep learning based VO has drawn significant attentions due to its potentials in learning capability and the robustness to camera parameters and challenging environments [8], [9]. These data-driven VO methods have successfully learned new feature representations from images that are used to further improve the motion estimation.…”
Section: Introductionmentioning
confidence: 99%