2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.00136
|View full text |Cite
|
Sign up to set email alerts
|

D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
258
0
4

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 362 publications
(262 citation statements)
references
References 57 publications
0
258
0
4
Order By: Relevance
“…The aforementioned camera pose estimation is also related to visual odometry (VO) and visual-inertial odometry (VIO) [133], [134], which aim to calculate sequential camera poses of an agent based on the camera and Inertial Measurement Unit (IMU) sensors. VO and VIO are always used ad the front-end in a SLAM system, where the back-end refers to the nonlinear optimization of the pose graph, aiming to obtain globally consistent and drift-free pose estimation results.…”
Section: ) Text Spottingmentioning
confidence: 99%
“…The aforementioned camera pose estimation is also related to visual odometry (VO) and visual-inertial odometry (VIO) [133], [134], which aim to calculate sequential camera poses of an agent based on the camera and Inertial Measurement Unit (IMU) sensors. VO and VIO are always used ad the front-end in a SLAM system, where the back-end refers to the nonlinear optimization of the pose graph, aiming to obtain globally consistent and drift-free pose estimation results.…”
Section: ) Text Spottingmentioning
confidence: 99%
“…Three key advantages to using deep learning methods in mobile robot systems are [19]: data without much mathematical modeling for different conditions such as abnormal lighting, blur motion, or cases where the camera needs calibration by hand, which is hard to do [287]. These models are able to match in some ways human-level reasoning and apply it in methods such as SLAM to create semantic maps [288], [289], or taking visual odometry to the next level [290], [291].…”
Section: Integration Of Deep Learning In Mobile Robotsmentioning
confidence: 99%
“…Hybrid methods combining classical models and deep learning approaches have seen a higher accuracy in terms of performance. Some hybrid algorithms like D3VO [291]) even outperform VIO algorithms such as DeepVIO [309], VIOLearner [307] as well as classical VO systems such as DSO [310], ORB SLAM [298] in KTTI benchmarks.…”
Section: A Odometrymentioning
confidence: 99%
“…A common assumption used by current works is photometric consistency, that is, the photometric error of corresponding pixel of the same object in different frames is zero. The photometric consistency assumption is often not satisfied because of brightness change and non-Lambertian surface [ 19 ]. To overcome these issues, GeoNet [ 11 ] added structural similarity (SSIM) [ 20 ] to loss to mitigate the effects of brightness change.…”
Section: Introductionmentioning
confidence: 99%
“…SSIM captures more local information than SAD, but it does not capture global information. D3VO [ 19 ] predicted the global transformation parameters through a network, and adjusts the image I to . However, D3VO only pays attention to the global brightness change, which is often hard to be satisfied in the real scene.…”
Section: Introductionmentioning
confidence: 99%