EPI-Patch Based Convolutional Neural Network for Depth Estimation on 4D Light Field

Luo, Yaoxiang; Zhou, Wenhui; Junpeng, Fang; Liang, Linkai; Zhang, Hua; Dai, Guojun

doi:10.1007/978-3-319-70090-8_65

Cited by 47 publications

(37 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recently, deep learning methods have gained much attention in estimating depth from light fields. Heber et al [14], Luo et al [15], Heber et al [16] and Feng et al [17] feed the input of Epipolar Plane Image (EPI) to the ConvNet where the network learns the proportional relation between the slope of the epipolar line and depth. However, this relation is hard to learn in wide-baseline light fields due to the absence of the epipolar line on the EPI.…”

Section: A Deep Learning-based Methodsmentioning

confidence: 99%

“…A light field image from the camera is usually separated into the so-called subaperture images, and the baseline between sub-aperture images is very narrow. To date, traditional [2][3][4][5][6][7][8][9][10][11][12][13] and ConvNet-based [14][15][16][17][18][19][20] methods have been well studied for high performance in narrow-baseline light fields, and achieved a low percentage of errors, e.g., EPINET [19]. For wide-baseline light fields, they are usually captured by a camera array or gantry (i.e., Manuscript received 2020.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Lightweight Depth Estimation Network for Wide-Baseline Light Fields

Wang

Zhang

et al. 2021

IEEE Trans. on Image Process.

View full text Add to dashboard Cite

Existing traditional and ConvNet-based methods for light field depth estimation mainly work on the narrow-baseline scenario. This paper explores the feasibility and capability of ConvNets to estimate depth in another promising scenario: wide-baseline light fields. Due to the deficiency of training samples, a large-scale and diverse synthetic wide-baseline dataset with labelled data is introduced for depth prediction tasks. Considering the practical goal for real-world applications, we design an end-to-end trained lightweight convolutional network to infer depths from light fields, called LLF-Net. The proposed LLF-Net is built by incorporating a cost volume which allows variable angular light field inputs and an attention module that enables to recover details at occlusion areas. Evaluations are made on the synthetic and real-world wide-baseline light fields, and experimental results show that the proposed network achieves the best performance when compared to recent stateof-the-art methods. We also evaluate our LLF-Net on narrowbaseline datasets, and it consequently improves the performance of previous methods.

show abstract

Section: A Deep Learning-based Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A Lightweight Depth Estimation Network for Wide-Baseline Light Fields

Wang

Zhang

et al. 2021

IEEE Trans. on Image Process.

View full text Add to dashboard Cite

show abstract

“…Based on the epipolar plane image (EPI) or epipolar geometry property, [10] proposed to formulate the depth estimation as a classification problem, in which a standard CNN-architecture is employed on horizontal and vertical EPI patches. Since a shallow CNN is inadequate to guarantee the accuracy, a global optimization with traditional approach is utilized.…”

Section: Related Workmentioning

confidence: 99%

“…Light field depth estimation now turns into an active research topic with its potential to obtain more accurate depth maps [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16], which is important to many depth-based vision applications, such as 3D reconstruction, semantic segmentation. In these vision fields, deep learning approaches typically surpass the traditional methods, both in accuracy and speed.…”

Section: Introductionmentioning

confidence: 99%

“…Moreover, a time-consuming global optimization is often adopted in order to improve depth accuracy [1, 3-6, 9, 11, 12]. Following the success of convolutional neural networks (ConvNets) in vision applications, some methods propose to adopt ConvNets to extract features for coarse estimates, followed by a global optimization [10,13] or refinement network [15] to improve the accuracy and/or speed. Recently, Epinet [14], a network trained end-to-end without postprocessing, achieves state-of-the-art accuracy onto the HCI and CVIA-HCI datasets [17,18].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Manet: Multi-Scale Aggregated Network For Light Field Depth Estimation

Zhang

Wang

et al. 2020

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

We present a novel end-to-end network, MANet, for light field depth estimation. MANet is a parameter-effective and efficient multi-scale aggregated network, which is about 3 times smaller and 3 times faster than the current top-performing method Epinet. The MANet architecture is performed for estimating depth from light field plenoptic cameras, and experimental results show that the proposed MANet outperforms state-of-the-art methods on HCI, CVIA-HCI and EPFL Lytro light field datasets.

show abstract