European Conference on Visual Media Production 2020
DOI: 10.1145/3429341.3429355
|View full text |Cite
|
Sign up to set email alerts
|

Constant Velocity Constraints for Self-Supervised Monocular Depth Estimation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
20
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 20 publications
(20 citation statements)
references
References 29 publications
0
20
0
Order By: Relevance
“…Supervised depth estimation has a significant body of work [19], [20], but requires pixel-wise depth labeling. Selfsupervised learning has been applied to monocular depth estimation, achieving better performance than supervised methods [21], [22]. Further research incorporates semantic information [23], [24], but monocular depth estimation remains challenging due to its ill-posed nature.…”
Section: B Self-supervised Depth Estimationmentioning
confidence: 99%
“…Supervised depth estimation has a significant body of work [19], [20], but requires pixel-wise depth labeling. Selfsupervised learning has been applied to monocular depth estimation, achieving better performance than supervised methods [21], [22]. Further research incorporates semantic information [23], [24], but monocular depth estimation remains challenging due to its ill-posed nature.…”
Section: B Self-supervised Depth Estimationmentioning
confidence: 99%
“…Furthermore, due to the ambiguous nature of photometric loss, the depth can only be predicted up to an unknown scale factor. Since then, a number of works [2,17,19,20,37,43,45,64,75,82,83,86] advanced the field considerably. For example, Godard et al [17] propose to upsample the multi-scale depth maps before loss calculation and use the minimum photometric error to tackle occlusions.…”
Section: Self-supervised Monocular Depth Estimationmentioning
confidence: 99%
“…The generated depth images are much more accurate, but very sparse. Therefore, recent methods follow a self-supervised training strategy using either stereo images [14,16], video sequences [2,19,20,45,64,75,[82][83][84]86], or both [17,37,43] during training. The training objective is formulated as an image synthesis problem based on geometric constraints.…”
Section: Introductionmentioning
confidence: 99%
“…Supervised depth estimation approaches [1,5,8,10,31,35,38] can predict dense depth maps but require costly labelled depth ground truth. In contrast, self-supervised approaches require no labelled data [12,13,23,30,34,39,41,[46][47][48][49] and are performing competitively at this task. At a high level, self-supervised depth estimation approaches use a depth networks' output as an intermediate representation for a stereo matching problem or an image reconstruction task.…”
Section: Introductionmentioning
confidence: 99%
“…At a high level, self-supervised depth estimation approaches use a depth networks' output as an intermediate representation for a stereo matching problem or an image reconstruction task. For the latter, these approaches [13,34,43,[47][48][49] are trained with a selfsupervised monocular depth estimation (SDE) framework.…”
Section: Introductionmentioning
confidence: 99%