2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018
DOI: 10.1109/iros.2018.8593748
|View full text |Cite
|
Sign up to set email alerts
|

Estimating Metric Poses of Dynamic Objects Using Monocular Visual-Inertial Fusion

Abstract: A monocular 3D object tracking system generally has only up-to-scale pose estimation results without any prior knowledge of the tracked object. In this paper, we propose a novel idea to recover the metric scale of an arbitrary dynamic object by optimizing the trajectory of the objects in the world frame, without motion assumptions. By introducing an additional constraint in the time domain, our monocular visualinertial tracking system can obtain continuous six degree of freedom (6-DoF) pose estimation without … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
3
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 21 publications
1
3
0
Order By: Relevance
“…The architecture for "model-based tracking" enables the estimation of 3D object detection, but to our knowledge, no work has been done in this field to link two autonomous systems such as a CNN and VINS altogether. Most of these methods use visual features such as ORB [28] or region-based visual bundle adjustment where the first detection is supposed to be correct as in [19]. This paper is very close from our work in terms for 2D-3D matching.…”
Section: B Tracking Systemssupporting
confidence: 64%
See 1 more Smart Citation
“…The architecture for "model-based tracking" enables the estimation of 3D object detection, but to our knowledge, no work has been done in this field to link two autonomous systems such as a CNN and VINS altogether. Most of these methods use visual features such as ORB [28] or region-based visual bundle adjustment where the first detection is supposed to be correct as in [19]. This paper is very close from our work in terms for 2D-3D matching.…”
Section: B Tracking Systemssupporting
confidence: 64%
“…Between tracking and mapping field of research, object pose estimation has also been researched with [14] [19] or without neural networks [7]. However, this latter method were not developed in the perspective of a robotic object detection but rather for environment mapping like Semantic SLAM does.…”
Section: Related Work a Object Detection In 3dmentioning
confidence: 99%
“…However, the improvement of the performance of the VINS through the M-estimator is limited in dynamic scenarios. Similar research was extended in [39,40] and the same framework was used. The M-estimator was applied to increase the robustness of the standard error function and improved performance was obtained.…”
Section: Introductionmentioning
confidence: 99%
“…However, the improvement of the performance of the VINS based on the M-estimator is limited in dynamic scenarios. In fact, similar researches are extended in [37,38], and the same framework is used. M-estimator is applied to increase the robustness of the standard error function, and the improved performance is obtained.…”
Section: Introductionmentioning
confidence: 99%