AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach

Song, Xiaoning; Yang, Gang; Zhu, Xinge; Zhou, Hui; Ma, Yuexin; Wang, Zhe; Shi, Jianping

doi:10.1007/s11263-021-01549-6

Cited by 12 publications

(2 citation statements)

References 94 publications

(141 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Specifically, compared to our pre-trained model UCFNet pretrain, UCFNet adapt achieves 37.78%, 40.38%, 20%, 37.5% error reduction on KITTI2012, KITTI2015, Middlebury, and ETH3D, respectively. Moreover, compared to the current best-published domain adaptation method AdaStereo [52], [53], our method can still outperform it on three of four datasets, which further proves the effectiveness of the proposed method. Note that our method doesn't employ the non-adversarial progressive color transfer and cost normalization proposed in AdaStereo, thus, the performance of our method has the potential for further improvement.…”

Section: Robustness Evaluationsupporting

confidence: 52%

“…Threshold δ of the ground truth uncertainty mask is 1. Asymmetric chromatic augmentation and asymmetric occlusion [64] [59] synthetic+TDD(no gt) 8.6 7.8 --ZOLE [38] synthetic+TDD(no gt) -6.8 --MADNet [57] synthetic+TDD(no gt) 9.3 8.5 --AdaStereo [52], [53] synthetic+TDD(no gt) epochs with a learning rate of 0.0001. The core idea of our threestage finetune strategy is to prevent the small datasets from being overwhelmed by large datasets.…”

Section: Implementation Detailsmentioning

confidence: 99%

See 1 more Smart Citation

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching

Shen

Dai

Rao

2021

2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

223

View full text Add to dashboard Cite

Due to the domain differences and unbalanced disparity distribution across multiple datasets, current stereo matching approaches are commonly limited to a specific dataset and generalize poorly to others. Such domain shift issue is usually addressed by substantial adaptation on costly target-domain ground-truth data, which cannot be easily obtained in practical settings. In this paper, we propose to dig into uncertainty estimation for robust stereo matching. Specifically, to balance the disparity distribution, we employ a pixel-level uncertainty estimation to adaptively adjust the next stage disparity searching space, in this way driving the network progressively prune out the space of unlikely correspondences. Then, to solve the limited ground truth data, an uncertainty-based pseudo-label is proposed to adapt the pre-trained model to the new domain, where pixel-level and area-level uncertainty estimation are proposed to filter out the high-uncertainty pixels of predicted disparity maps and generate sparse while reliable pseudo-labels to align the domain gap. Experimentally, our method shows strong cross-domain, adapt, and joint generalization and obtains 1st place on the stereo task of Robust Vision Challenge 2020. Additionally, our uncertainty-based pseudo-labels can be extended to train monocular depth estimation networks in an unsupervised way and even achieves comparable performance with the supervised methods. The code will be available at https://github.com/gallenszl/UCFNet.

show abstract

Section: Robustness Evaluationsupporting

confidence: 52%