2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00339
|View full text |Cite
|
Sign up to set email alerts
|

Group-Wise Correlation Stereo Network

Abstract: Stereo matching estimates the disparity between a rectified image pair, which is of great importance to depth sensing, autonomous driving, and other related tasks. Previous works built cost volumes with cross-correlation or concatenation of left and right features across all disparity levels, and then a 2D or 3D convolutional neural network is utilized to regress the disparity maps. In this paper, we propose to construct the cost volume by group-wise correlation. The left features and the right features are di… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
445
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 573 publications
(482 citation statements)
references
References 36 publications
2
445
0
Order By: Relevance
“…On Tab. 1 we report our result compared to other (published) fast inference architectures on the leaderboard (runtime measured on NVIDIA 1080Ti) as well as with a slower and more accurate one, GWCNet [11]. At the time of writing, our method ranks 90 th .…”
Section: Madnet Performancementioning
confidence: 96%
See 1 more Smart Citation
“…On Tab. 1 we report our result compared to other (published) fast inference architectures on the leaderboard (runtime measured on NVIDIA 1080Ti) as well as with a slower and more accurate one, GWCNet [11]. At the time of writing, our method ranks 90 th .…”
Section: Madnet Performancementioning
confidence: 96%
“…Although not achieving state-ofthe-art accuracy, this seminal work turned out quite disruptive compared to the traditional stereo paradigm outlined in [35], highlighting the potential for a totally new approach. Thereby, [21] ignited the spread of end-to-end stereo architectures [17,24,19,4,16,11] that quickly outmatched any other technique on the KITTI benchmarks by leveraging on a peculiar training protocol. In particular, the deep network is initially trained on a large amount of synthetic data with groundtruth labels [21] and then fine-tuned on the target domain (e.g., KITTI) based on stereo pairs with groundtruth.…”
Section: Related Workmentioning
confidence: 99%
“…All pixels Non-occluded pixels Runtime (s) Environment D1-bg (%) D1-fg (%) D1-all (%) D1-bg (%) D1-fg (%) D1-all (%) PSMNet [28] 1 [31] and sparse technique [5], to address this problem, there is still a long way to go before a practical solution is developed. And of course, this kind of end-to-end stereo matching network needs corresponding ground truth depth data for training, which is a challenging problem.…”
Section: Methodsmentioning
confidence: 99%
“…Tons of algorithms based on this have been proposed. ese methods could roughly be categorized into two groups: 2D encode-decoder structures [23][24][25][26][27] and regularization modules composed of 3D convolutions [28][29][30][31]. DispNetC [24] computes a correlation volume from the left and right image features (encoding) and utilizes a CNN to directly regress (decoding) disparity maps.…”
Section: Introductionmentioning
confidence: 99%
“…Following this seminal work, [13] proposed an end-to-end architecture inspired by the classical stereo matching pipeline. Many works followed this trend [2,23,20,18,24,9,5] and completely outmatched previous methods in terms of accuracy. However, this gain in accuracy is at the cost of high computation requirement.…”
Section: Related Workmentioning
confidence: 95%