Vision-based Control of a Quadrotor in User Proximity: Mediated vs End-to-End Learning Approaches

Mantegazza, Dario; Guzzi, Jérôme; Gambardella, Luca Maria; Giusti, Alessandro

doi:10.1109/icra.2019.8794377

Cited by 9 publications

(21 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Semi-supervised estimation of user pose in a nano-drone: user's head location (x, y, z) and heading (φ) predictions. Compared to the model trained using only the task loss [26] (blue), our approach with ∆t = 0.5s (red) is usually closer to the ground-truth (green). Videos available in supplementary material.…”

Section: Resultsmentioning

confidence: 92%

“…B. Semi-supervised estimation of user pose in a nano-drone 1) Quantitative analysis: On the held-out testing set, we train the proposed approach using ∆t ∈ {0.5, 1.0, 1.5, 2.0} seconds; we compare the results of these models with two alternatives: one baseline model trained with λ sc = 0 (equivalent to using only labeled data as in [26]); and a model acting as Fig. 5.…”

Section: Resultsmentioning

confidence: 99%

“…The R 2 metric quantifies the fraction of the variance of the target variable that is explained by the model, and ranges between 0% (for a model that trivially predicts the average of the testing set) and 100% (for an ideal model); this metric does not depend on the unit of measure and is more interpretable (but related to) the MSE. We quantify the improvement of our model over the baseline model, trained using only the task loss [26], both in absolute terms (R 2 points) and, more importantly, as percentage relative to the maximum achievable improvement; this is computed as (model−baseline)/(upper− baseline) and yields 0% for a model that performs as well as the baseline, and 100% for a model that performs as well as the upper-bound.…”

Section: Resultsmentioning

confidence: 99%

“…7. Semi-supervised estimation of user pose in a nano-drone: R 2 computed for a model trained using only the task loss [26] (baseline), our model trained using task and state-consistency losses using different values of ∆t, and the upper-bound; we report absolute improvements of our approach (∆t = 0.5) vs the baseline, and improvements rescaled such that baseline performance is 0% and upper-bound performance is 100%.…”

Section: Resultsmentioning

confidence: 99%

“…We consider a Crazyflie 2.1 nano-quadrotor equipped with a front-facing camera X(•) which produces a gray-scale image with size 160 x 96 pixels. The task is a 3-variable regression problem: predicting the relative pose O(•) (frontal displacement x, lateral displacement y, and relative head rotation φ) of a user moving in front of the drone [26].…”

Section: B Semi-supervised Estimation Of User Pose In a Nano-dronementioning

confidence: 99%

See 4 more Smart Citations

State-Consistency Loss for Learning Spatial Perception Tasks From Partial Labels

Nava

Gambardella

Giusti

2021

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

When learning models for real-world robot spatial perception tasks, one might have access only to partial labels: this occurs for example in semi-supervised scenarios (in which labels are not available for a subset of the training instances) or in some types of self-supervised robot learning (where the robot autonomously acquires a labeled training set, but only acquires labels for a subset of the output variables in each instance). We introduce a general approach to deal with this class of problems using an auxiliary loss enforcing the expectation that the perceived environment state should not abruptly change; then, we instantiate the approach to solve two robot perception problems: a simulated ground robot learning long-range obstacle mapping as a 400-binary-label classification task in a self-supervised way in a static environment; and a real nano-quadrotor learning human pose estimation as a 3-variable regression task in a semisupervised way in a dynamic environment. In both cases, our approach yields significant quantitative performance improvements (average increase of 6 AUC percentage points in the former; relative improvement of the R 2 metric ranging from 7% to 33% in the latter) over baselines.

show abstract

Section: Resultsmentioning

confidence: 92%

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

Section: B Semi-supervised Estimation Of User Pose In a Nano-dronementioning

confidence: 99%

See 3 more Smart Citations