2019
DOI: 10.1007/978-3-030-11009-3_20
|View full text |Cite
|
Sign up to set email alerts
|

Generative Adversarial Networks for Unsupervised Monocular Depth Prediction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
76
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
4
1

Relationship

2
8

Authors

Journals

citations
Cited by 82 publications
(77 citation statements)
references
References 43 publications
0
76
0
1
Order By: Relevance
“…Concerning stereo supervision, Garg et al [14] first followed this approach, while Godard et al [16] introduced a leftright consistency loss. Other methods improved efficiency [35], deploying a pyramidal architecture, and accuracy by simulating a trinocular setup [36], including joint semantic segmentation [49] or adding adversarial term [1,6]. In [33], a strategy was proposed to reduce further the energy efficiency of [35] leveraging fixed-point quantization.…”
Section: Related Workmentioning
confidence: 99%
“…Concerning stereo supervision, Garg et al [14] first followed this approach, while Godard et al [16] introduced a leftright consistency loss. Other methods improved efficiency [35], deploying a pyramidal architecture, and accuracy by simulating a trinocular setup [36], including joint semantic segmentation [49] or adding adversarial term [1,6]. In [33], a strategy was proposed to reduce further the energy efficiency of [35] leveraging fixed-point quantization.…”
Section: Related Workmentioning
confidence: 99%
“…3.3) by applying it to [14] and improving their model. Our full model using a generic encoder-decoder outperforms all variants on every metric, including [2] which predicts disparities that generate photo-realistic images. Our full model using our proposed two-branch decoder (*) further improves the state-of-the-art.…”
Section: Kitti Eigen Splitmentioning
confidence: 99%
“…Over the last years, convolutional neural networks (CNNs)s have been shown to be well-suited for both SfM, multi-view and monocular approaches, and they can be trained supervised by large RGB-D indoor datasets acquired by consumer ToF cameras. Since the acquisition of ground truth depth in large outdoor environments is challenging, semi-supervised [27] or even self-supervised approaches [2,12,15,41] have been proposed, tackling this challenge by solving proxy tasks such as stereo matching.…”
Section: Related Workmentioning
confidence: 99%