2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.159
|View full text |Cite
|
Sign up to set email alerts
|

End-to-End Training of Hybrid CNN-CRF Models for Stereo

Abstract: We propose a novel and principled hybrid CNN+CRF model for stereo estimation. Our model allows to exploit the advantages of both, convolutional neural networks (CNNs) and conditional random fields (CRFs) in an unified approach. The CNNs compute expressive features for matching and distinctive color edges, which in turn are used to compute the unary and binary costs of the CRF. For inference, we apply a recently proposed highly parallel dual block descent algorithm which only needs a small fixed number of itera… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
77
0

Year Published

2018
2018
2019
2019

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 128 publications
(77 citation statements)
references
References 44 publications
0
77
0
Order By: Relevance
“…Several techniques have been developed in the field of structured support vector machines (SSVMs) [92,28,1,95] that are very relevant to the task of learning energy models, as SSVMs can be understood as bi-level problems with a lower-level energy that is linear in θ and often a noncontinuous higher-level loss. Various strategies such as margin rescaling [92], slack rescaling [95,97], softmaxmargins [40] exist and have also been applied recently in the training of computer vision models in [54,29], we will later return to their connection to the investigated strategies.…”
Section: Related Workmentioning
confidence: 99%
“…Several techniques have been developed in the field of structured support vector machines (SSVMs) [92,28,1,95] that are very relevant to the task of learning energy models, as SSVMs can be understood as bi-level problems with a lower-level energy that is linear in θ and often a noncontinuous higher-level loss. Various strategies such as margin rescaling [92], slack rescaling [95,97], softmaxmargins [40] exist and have also been applied recently in the training of computer vision models in [54,29], we will later return to their connection to the investigated strategies.…”
Section: Related Workmentioning
confidence: 99%
“…(ii) This positive trend is transferred to the test set for the average error and the RMS error. (iii) The bad{0.5, 1} errors on the test set are reduced and (iv) the bad{2, 4} errors slightly increase on the test set compared to [11]. One reason for this is the limited amount of training data for these very high-resolution images.…”
Section: Benchmark Performancementioning
confidence: 99%
“…We use our method on top of the CNN-CRF [11] stereo method for the official test set evaluation (see Table 2). We set the temperature parameter η = 0.075 in all experiments.…”
Section: Benchmark Performancementioning
confidence: 99%
“…Supervised learning techniques include models which are trained on stereo images, but which can infer depth maps on monocular images. [12] propose an approach which follows this paradigm. They use the correlations of CNN feature maps of stereo images and derive the unary matching costs.…”
Section: Related Workmentioning
confidence: 99%