CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching

Shen, Zhelun; Dai, Yuchao; Rao, Zhibo

doi:10.1109/cvpr46437.2021.01369

Cited by 225 publications

(87 citation statements)

References 62 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…ETH3D. As shown in Table 5, our ACVNet outperforms the state-of-the-art methods, HITNet [18] and CFNet [16].…”

Section: Kittimentioning

confidence: 77%

“…Specifically, we denote the model after applying our method as GwcNet-ACV, PSMNet-ACV and CFNet-ACV for comparison respectively. As shown in is reduced by 39.5% for GwcNet [7], 42.2% for PSMNet [2] and 14.4% for CFNet [16]. We experimentally compare our ACV with cascaded approaches.…”

Section: Universality and Superiority Of Acvmentioning

confidence: 98%

“…Following the 4D combined cost volume, cascade cost volumes [6,16,19] further reduce the memory and computational complexity of cost volume construction by building a cost volume pyramid in a coarse-to-fine manner to progressively narrow down the target disparity range and refine the depth map. However, such coarse-to-fine strategy inevitably involves accumulated errors, i.e.…”

Section: Related Workmentioning

confidence: 99%

“…mp denotes multi-level adaptive patch (Figure 3) which has different rates and adaptive weights for three-level feature maps. Model D1 (%) EPE (px) PSMNet [2] 3.89 1.09 PSMNet-ACV 2.17 0.63 GwcNet [7] 2.71 0.76 GwcNet-ACV 1.55 0.46 CFNet [16] 4.51 0.97 CFNet-ACV 4.02 0.83 Table 3. Universality study of ACV on Scene Flow [12].…”

Section: Implementation Detailsmentioning

confidence: 99%

“…To demonstrate the universality of our ACV, we integrate our ACV into three state-of-the-art models, i.e. Gwc-Net [7], PSMNet [2] and CFNet [16], and compare the performance of the original models with those after using our ACV. Specifically, we denote the model after applying our method as GwcNet-ACV, PSMNet-ACV and CFNet-ACV for comparison respectively.…”

Section: Universality and Superiority Of Acvmentioning

confidence: 99%

See 4 more Smart Citations

Attention Concatenation Volume for Accurate and Efficient Stereo Matching

Xu¹,

Cheng²,

Guo³

et al. 2022

Preprint

View full text Add to dashboard Cite

Stereo matching is a fundamental building block for many vision and robotics applications. An informative and concise cost volume representation is vital for stereo matching of high accuracy and efficiency. In this paper, we present a novel cost volume construction method which generates attention weights from correlation clues to suppress redundant information and enhance matching-related information in the concatenation volume. To generate reliable attention weights, we propose multi-level adaptive patch matching to improve the distinctiveness of the matching cost at different disparities even for textureless regions. The proposed cost volume is named attention concatenation volume (ACV) which can be seamlessly embedded into most stereo matching networks, the resulting networks can use a more lightweight aggregation network and meanwhile achieve higher accuracy, e.g. using only 1/25 parameters of the aggregation network can achieve higher accuracy for GwcNet. Furthermore, we design a highly accurate network (ACVNet) based on our ACV, which achieves state-ofthe-art performance on several benchmarks. The code is available at https://github.com/gangweiX/ACVNet.

show abstract

“…ETH3D. As shown in Table 5, our ACVNet outperforms the state-of-the-art methods, HITNet [18] and CFNet [16].…”

Section: Kittimentioning

confidence: 77%

Section: Universality and Superiority Of Acvmentioning

confidence: 98%

Section: Related Workmentioning

confidence: 99%

Section: Implementation Detailsmentioning

confidence: 99%

Section: Universality and Superiority Of Acvmentioning

confidence: 99%

See 3 more Smart Citations

Attention Concatenation Volume for Accurate and Efficient Stereo Matching

Xu¹,

Cheng²,

Guo³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

AAEE‐Net: Attention‐guided aggregation and error‐aware enhancement network for accurate and efficient stereo matching

Liu

Zhang

et al. 2023

Concurrency and Computation

View full text Add to dashboard Cite

Stereo matching is a fundamental and long‐standing task in computer vision. Although learning‐based stereo matching algorithms have made remarkable progress, two major challenges still persist. Firstly, existing cost aggregation methods that use stacked three‐dimensional convolutions are complex, leading to heavy computation and memory costs. Secondly these methods continue to struggle with establishing reliable matches in weakly matchable such as that edges and thin structures. To overcome these limitations, we propose an accurate and efficient network called Attention‐guided Aggregation and Error‐aware Enhancement Network (AAEE‐Net). Our approach involves designing an Attention‐guided Aggregation Mechanism (AAM) based on simple image features. This mechanism uses attention weights generated from image features to guide cost aggregation with a more efficient and effective strategy. Additionally, we propose an Error‐aware Enhancement Module (EEM) that refines the raw disparity by combining high‐frequency information from the original image and warp error between the left and right views. EEM enables the network to learn error correction capabilities that produce excellent subtle details and sharp edges. The experimental results on the SceneFlow and KITTI benchmark datasets demonstrate that AAEE‐Net achieves state‐of‐the‐art performance with low inference time. The qualitative results show that AAEE‐Net significantly improves predictions, especially for thin structures.

show abstract

Self-adapting Confidence Estimation for Stereo

Poggi

Aleotti

Tosi

et al. 2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Project page: https://fedstereo.github.io/ ~80 FPS 13.86% D1-all 9.16% D1-all 8.36% D1-all ~10 FPS ~80 FPS (a) (b) (c) (d) Figure 1. Federated adaptation in challenging environments. When facing a domain very different from those observed during training -e.g., nighttime images (a) -stereo models [55] suffer drops in accuracy (b). By enabling online adaptation [41] (c) the network can improve its predictions, at the expense of decimating the framerate. In our federated framework, the model can demand the adaptation process to the cloud, to enjoy its benefits while maintaining the original processing speed (d).

show abstract

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching

Cited by 225 publications

References 62 publications

Attention Concatenation Volume for Accurate and Efficient Stereo Matching

Attention Concatenation Volume for Accurate and Efficient Stereo Matching

AAEE‐Net: Attention‐guided aggregation and error‐aware enhancement network for accurate and efficient stereo matching

Self-adapting Confidence Estimation for Stereo

Contact Info

Product

Resources

About