2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016
DOI: 10.1109/cvpr.2016.438
|View full text |Cite
|
Sign up to set email alerts
|

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

Abstract: Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we propose three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

7
2,506
0
12

Year Published

2017
2017
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 2,343 publications
(2,525 citation statements)
references
References 30 publications
7
2,506
0
12
Order By: Relevance
“…Our full-frame Coordinate CNN is based on the architecture of DispNet [20]. It is fully convolutional [16] such that dense per-pixel scene coordinate predictions can be generated from arbitrary-sized input images.…”
Section: A Network Architecturementioning
confidence: 99%
See 1 more Smart Citation
“…Our full-frame Coordinate CNN is based on the architecture of DispNet [20]. It is fully convolutional [16] such that dense per-pixel scene coordinate predictions can be generated from arbitrary-sized input images.…”
Section: A Network Architecturementioning
confidence: 99%
“…Moreover, shortcut connections are added in between to overcome the data bottleneck. Unlike DispNet [20], there is only one final output layer at the end of the network and no multi-scale side predictions are used. Instead of ReLU [22], we use ELU [5] for the nonlinearity between layers.…”
Section: A Network Architecturementioning
confidence: 99%
“…Some algorithms use deep convolutional networks to learn the matching cost [25,26], and perform cost aggregation and optimization using other techniques.…”
Section: Supervised Techniquesmentioning
confidence: 99%
“…In addition to the matching cost estimation network, another convolutional neural network is also undertaken for obtain- ing the disparity map in place of winner-takes-all (WTA) strategy [7]. A large synthetic dataset is rendered to train an end-to-end network with images rather than small patches as input [8]. The attained disparity maps does not achieve state-of-the-art performance, yet it is able to recover occlusions, where most patch-based networks fail.…”
Section: Introductionmentioning
confidence: 99%