2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00340
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Level Context Ultra-Aggregation for Stereo Matching

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
56
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 111 publications
(56 citation statements)
references
References 21 publications
0
56
0
Order By: Relevance
“…Following this seminal work, [13] proposed an end-to-end architecture inspired by the classical stereo matching pipeline. Many works followed this trend [2,23,20,18,24,9,5] and completely outmatched previous methods in terms of accuracy. However, this gain in accuracy is at the cost of high computation requirement.…”
Section: Related Workmentioning
confidence: 95%
“…Following this seminal work, [13] proposed an end-to-end architecture inspired by the classical stereo matching pipeline. Many works followed this trend [2,23,20,18,24,9,5] and completely outmatched previous methods in terms of accuracy. However, this gain in accuracy is at the cost of high computation requirement.…”
Section: Related Workmentioning
confidence: 95%
“…Existing end-to-end disparity estimation networks usually include cost volume computation, cost aggregation, and disparity prediction. 2D CNN based methods [17,20,33] generally adopt a correlation layer for 3D cost volume construction, while 3D CNN based methods [2,3,8,22,38] mostly use direct feature concatenation to construct 4D cost volume and use 3D convolution for cost aggregation. Apart from supervised methods, several unsupervised learning methods [14,24,29,36,40] have been developed to avoid the use of costly ground truth depth annotations.…”
Section: Disparity Estimationmentioning
confidence: 99%
“…It plays a key role in a range of real-world applications, such as stereo matching [2], image understanding [3], co-saliency detection [4], action recognition [5], video detection and segmentation [6][7][8][9], semantic segmentation [10,11], medical image segmentation [12][13][14], object tracking [15,16], person re-identification [17,18], camouflaged object detection [19], image retrieval [20], etc. Although significant progress has been made in the salient object detection field over the past several years [21][22][23][24][25][26][27][28][29][30][31][32][33][34][35], there is still room for improvement when faced with challenging factors, such as complicated backgrounds or varying lighting conditions in the scenes.…”
Section: Introductionmentioning
confidence: 99%
“…(2) According to fusion model: it is critical to effectively fuse RGB and depth images in this task, so we review different fusion strategies to understand their effectiveness. (3) As single-or multi-stream models: using a single stream can reduce the number of parameters, but the final result may not be optimal; multiple streams may require more parameters.…”
mentioning
confidence: 99%