2019 International Conference on Robotics and Automation (ICRA) 2019
DOI: 10.1109/icra.2019.8794377
|View full text |Cite
|
Sign up to set email alerts
|

Vision-based Control of a Quadrotor in User Proximity: Mediated vs End-to-End Learning Approaches

Abstract: We consider the task of controlling a quadrotor to hover in front of a freely moving user, using input data from an onboard camera. On this specific task we compare two widespread learning paradigms: a mediated approach, which learns a high-level state from the input and then uses it for deriving control signals; and an end-to-end approach, which skips high-level state estimation altogether. We show that despite their fundamental difference, both approaches yield equivalent performance on this task. We finally… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
21
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 9 publications
(21 citation statements)
references
References 29 publications
0
21
0
Order By: Relevance
“…Semi-supervised estimation of user pose in a nano-drone: user's head location (x, y, z) and heading (φ) predictions. Compared to the model trained using only the task loss [26] (blue), our approach with ∆t = 0.5s (red) is usually closer to the ground-truth (green). Videos available in supplementary material.…”
Section: Resultsmentioning
confidence: 92%
See 4 more Smart Citations
“…Semi-supervised estimation of user pose in a nano-drone: user's head location (x, y, z) and heading (φ) predictions. Compared to the model trained using only the task loss [26] (blue), our approach with ∆t = 0.5s (red) is usually closer to the ground-truth (green). Videos available in supplementary material.…”
Section: Resultsmentioning
confidence: 92%
“…B. Semi-supervised estimation of user pose in a nano-drone 1) Quantitative analysis: On the held-out testing set, we train the proposed approach using ∆t ∈ {0.5, 1.0, 1.5, 2.0} seconds; we compare the results of these models with two alternatives: one baseline model trained with λ sc = 0 (equivalent to using only labeled data as in [26]); and a model acting as Fig. 5.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations