2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00931
|View full text |Cite
|
Sign up to set email alerts
|

PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume

Abstract: We present a compact but effective CNN model for optical flow, called PWC-Net. PWC-Net has been designed according to simple and well-established principles: pyramidal processing, warping, and the use of a cost volume. Cast in a learnable feature pyramid, PWC-Net uses the current optical flow estimate to warp the CNN features of the second image. It then uses the warped features and features of the first image to construct a cost volume, which is processed by a CNN to estimate the optical flow. PWC-Net is 17 t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

8
2,195
0
6

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 2,463 publications
(2,209 citation statements)
references
References 57 publications
8
2,195
0
6
Order By: Relevance
“…Learning from synthetic data. Training the model on large-scale synthetic datasets has been extensively studied in semantic segmentation [44,45,17,16,9,19,37,38,56], multi-view stereo [20], depth estimation [51], optical flow [43,21,23], amodal segmentation [18], and object detection [9,33]. In our work, we show that the proposed cross-domain consistency loss can be applied not only to synthetic-to-real adaptation but to real-to-real adaptation tasks as well.…”
Section: Related Workmentioning
confidence: 99%
“…Learning from synthetic data. Training the model on large-scale synthetic datasets has been extensively studied in semantic segmentation [44,45,17,16,9,19,37,38,56], multi-view stereo [20], depth estimation [51], optical flow [43,21,23], amodal segmentation [18], and object detection [9,33]. In our work, we show that the proposed cross-domain consistency loss can be applied not only to synthetic-to-real adaptation but to real-to-real adaptation tasks as well.…”
Section: Related Workmentioning
confidence: 99%
“…Since the source code for Super SloMo is not available, we implemented it ourselves. We replaced the flow computation network with PWC-Net [19] as it gives better performance, especially on videos with high resolution and large motion. For Gupta et al's approach, we use one of the scenes from their supplementary video for comparison.…”
Section: Resultsmentioning
confidence: 99%
“…For the deblurring and optical flow modules, we take advantage of existing neural network architectures which have performed well in the past for the respective supervised learning tasks [9], [17], [21], [30], [32]. In particular, we adopt the single image deblurring network from Tao et al [32] and the dense optical flow estimation network PWC-Net from Sun et al [30]. We make the following modifications for the deblurring network for our particular problem: 1) We replace the deconvolution layer with bilinear upsampling followed by a 3x3 convolution to avoid upsampling artifacts.…”
Section: A Deblurring and Optical Flowmentioning
confidence: 99%