2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00190
|View full text |Cite
|
Sign up to set email alerts
|

AutoDispNet: Improving Disparity Estimation With AutoML

Abstract: Much research work in computer vision is being spent on optimizing existing network architectures to obtain a few more percentage points on benchmarks. Recent AutoML approaches promise to relieve us from this effort. However, they are mainly designed for comparatively smallscale classification tasks. In this work, we show how to use and extend existing AutoML techniques to efficiently optimize large-scale U-Net-like encoder-decoder architectures. In particular, we leverage gradient-based neural architecture se… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
73
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 73 publications
(75 citation statements)
references
References 46 publications
0
73
0
Order By: Relevance
“…Inspired from the semantic segmentation task , recent methods (Mayer et al 2016;Kendall et al 2017;Saikia et al 2019;Cheng et al 2020) adopt fully-convolutional networks to regress disparity maps, which further improves the accuracy and effi-ciency of stereo matching. Concretely, current end-to-end stereo matching networks can be roughly categorized into two types: correlation-based 2-D stereo networks and costvolume based 3-D stereo networks.…”
Section: Stereo Matchingmentioning
confidence: 99%
“…Inspired from the semantic segmentation task , recent methods (Mayer et al 2016;Kendall et al 2017;Saikia et al 2019;Cheng et al 2020) adopt fully-convolutional networks to regress disparity maps, which further improves the accuracy and effi-ciency of stereo matching. Concretely, current end-to-end stereo matching networks can be roughly categorized into two types: correlation-based 2-D stereo networks and costvolume based 3-D stereo networks.…”
Section: Stereo Matchingmentioning
confidence: 99%
“…Inspired from the semantic segmentation task [49,8], recent methods [55,39,66,13] adopt fully-convolutional networks to regress disparity maps, which further improves the accuracy and efficiency of stereo matching. Concretely, current end-to-end stereo matching networks can be roughly categorized into two types: correlation-based 2-D stereo networks and cost-volume based 3-D stereo networks.…”
Section: Stereo Matchingmentioning
confidence: 99%
“…The early studies mainly focus on optimizing the existing network architectures by enormous hands-on trial-and-error tweaking efforts. Besides, recent studies also leverage multi-task learning [33]- [35] to combine other prior vision information and NAS-based methods [11], [27] to tweak the network structure as well as the operator hyper-parameters (i.e., kernel size and channel number for the convolution layer). According to the basic operator (related to the computational efficiency) and the network pipeline, we mainly discuss two branches of network structures for disparity estimation, the ED-Conv2D series and the CVM-Conv3D series.…”
Section: Related Workmentioning
confidence: 99%
“…The architectures of DNN are very essential to achieve accurate estimation, and can be categorized into two classes, the encoder-decoder network with 2D convolution (ED-Conv2D) and the cost volume matching with 3D convolution (CVM-Conv3D). Besides, recent studies [11], [12] begin to reveal the potential of automated machine learning (AutoML) for neural architecture search (NAS) on stereo matching. In practice, to measure whether a DNN model is applicable in real-world applications, we not only need to evaluate its accuracy on unseen stereo images (whether it can estimate the disparity correctly), but also need to evaluate its time efficiency (whether it can generate the results in real-time).…”
mentioning
confidence: 99%
See 1 more Smart Citation