2018
DOI: 10.1109/tro.2018.2853729
|View full text |Cite
|
Sign up to set email alerts
|

VINS-Mono: A Robust and Versatile Monocular Visual-Inertial State Estimator

Abstract: A monocular visual-inertial system (VINS), consisting of a camera and a low-cost inertial measurement unit (IMU), forms the minimum sensor suite for metric six degreesof-freedom (DOF) state estimation. However, the lack of direct distance measurement poses significant challenges in terms of IMU processing, estimator initialization, extrinsic calibration, and nonlinear optimization. In this work, we present VINS-Mono: a robust and versatile monocular visual-inertial state estimator. Our approach starts with a r… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
2,078
1
2

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 3,106 publications
(2,084 citation statements)
references
References 45 publications
3
2,078
1
2
Order By: Relevance
“…Such representations can be sparse [184], semidense [185], or fully dense [186]. While dense representations can be directly used for autonomous navigation [186] or geographical reference, sparse representations are often only used for state estimation [187] or collaborative control of robotic agents. Due to the fact that the environment is often only partially known or even totally unknown, mapping tasks are often coupled with localization (pose estimation) issues, which turn them into the classic simultaneous localization and mapping (SLAM) problem.…”
Section: Cooperative Aerial Mappingmentioning
confidence: 99%
“…Such representations can be sparse [184], semidense [185], or fully dense [186]. While dense representations can be directly used for autonomous navigation [186] or geographical reference, sparse representations are often only used for state estimation [187] or collaborative control of robotic agents. Due to the fact that the environment is often only partially known or even totally unknown, mapping tasks are often coupled with localization (pose estimation) issues, which turn them into the classic simultaneous localization and mapping (SLAM) problem.…”
Section: Cooperative Aerial Mappingmentioning
confidence: 99%
“…Among the methods evaluated in this paper, the recently proposed VINS-Mono pipeline [4] compares to OKVIS, but only on a few of the EuRoC datasets. In [2], OKVIS was compared to a non-public implementation of MSCKF [1] on non-public datasets.…”
Section: A Related Workmentioning
confidence: 99%
“…The software is available in both a ROScompatible PC version and an iOS implementation for state estimation on mobile devices. 4 …”
Section: Roviomentioning
confidence: 99%
See 1 more Smart Citation
“…The quadrotor is equipped with dual fisheye cameras with one upward facing and another downward facing. The localization is carried out by visual-inertial fusion (Qin, Li, & Shen, 2018) in the onboard CPU, and the depths of each pixel in images are estimated using stereo matching in an additional graphics processing unit (GPU). Compared with the above-mentioned LiDAR-based mapping system, the pixelwise depth estimation by the fisheye cameras is significantly poorer and has a nonnegligible latency.…”
Section: Test With a Degraded Sensormentioning
confidence: 99%