2019 IEEE International Conference on Multimedia and Expo (ICME) 2019
DOI: 10.1109/icme.2019.00093
|View full text |Cite
|
Sign up to set email alerts
|

DISCO: Depth Inference from Stereo using Context

Abstract: Recent deep learning based approaches have outperformed classical stereo matching methods. However, current deep learning based end-to-end stereo matching methods adopt a generic encoder-decoder style network with skip connections. To limit computational requirement, many networks perform excessive down sampling, which results in significant loss of useful low-level information. Additionally, many network designs do not exploit the rich multi-scale contextual information. In this work, we address these aforeme… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 22 publications
0
3
0
Order By: Relevance
“…8 demonstrates the practical utility of AcED in the challenging single camera bokeh application. AcED was first trained using our in-house synthetic dataset [32] containing realistic human centric images with dense depth ground-truth. To reduce the computation load for this task, the light weight MobileNet V2 [33] model was employed as the backbone encoder network.…”
Section: Resultsmentioning
confidence: 99%
“…8 demonstrates the practical utility of AcED in the challenging single camera bokeh application. AcED was first trained using our in-house synthetic dataset [32] containing realistic human centric images with dense depth ground-truth. To reduce the computation load for this task, the light weight MobileNet V2 [33] model was employed as the backbone encoder network.…”
Section: Resultsmentioning
confidence: 99%
“…Current (as of December 2019) top 10 depth estimation methods tested in Middlebury Stereo Evaluation Version 3 [51] for the Teddy sequence achieve the average error smaller than 1.36 and the percentage of bad pixels smaller than 5.57% (E T = 4.0) (e.g. methods described in [72], [73] and [74], see Table 6). Therefore, the proposed method shows state-of-the-art results in terms of the depth maps accuracy.…”
Section: Tablementioning
confidence: 99%
“…Chang et al proposed a pyramid stereo matching network [7] comprising a spatial pyramid pooling module and a 3D CNN module. Kunal Swami et al [8] proposed an end-to-end network model to utilize rich multiscale context information, which most existing methods cannot achieve. A large effective receiving domain is implemented to extract multiscale context information, while retaining the required spatial information in the entire network.…”
Section: Introductionmentioning
confidence: 99%