2020 IEEE International Conference on Visual Communications and Image Processing (VCIP) 2020
DOI: 10.1109/vcip49819.2020.9301793
|View full text |Cite
|
Sign up to set email alerts
|

3D-CNN Autoencoder for Plenoptic Image Compression

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 10 publications
0
7
0
Order By: Relevance
“…The CNN-based angular super-resolution model of [22], which uses a set of hand-crafted features for learning the disparity, forms a part of most of these approaches. While all the above learning-based approaches have a hybrid structure with dependence on multiple components, [23] proposes a simple 3DCNN autoencoder. The 3D convolution enables learning in the pseudo-temporal dimension and hence the model has an input data structure awareness.…”
Section: Prior Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The CNN-based angular super-resolution model of [22], which uses a set of hand-crafted features for learning the disparity, forms a part of most of these approaches. While all the above learning-based approaches have a hybrid structure with dependence on multiple components, [23] proposes a simple 3DCNN autoencoder. The 3D convolution enables learning in the pseudo-temporal dimension and hence the model has an input data structure awareness.…”
Section: Prior Workmentioning
confidence: 99%
“…Table II shows RGB PSNR values averaged across the 4 lenslet images available in the JPEG Pleno Light Field dataset [35]. Comparing with the only other end-to-end learning-based method [23] in terms of the reported RGB PSNR values there seems to be no advantage. However, there is a gain in terms of the reported encoding speed.…”
Section: Evaluation 1) Metricmentioning
confidence: 99%
See 1 more Smart Citation
“…Inspired by this, researchers have explored many deep learningbased video coding methods [15][16][17]. For light field image coding, some end-to-end coding frameworks have also been proposed, and preliminary results have been achieved [18][19][20][21]. Specifically, the first step involves the extraction of light field image features.…”
Section: Introductionmentioning
confidence: 99%
“…Select 80% of each class in the dataset at random as the training sample and the remainder as the test data Images and providing accuracy of 99.94%, 99.96%, and 99.99% respectively. Tingting Zhong Et.al [5] present an Adaptive Data Structure i.e a 3D convolutional (DSA-3D) auto encoder that adapts to the bone frame of pictures from a light field camera. The experimental assessment is performed using the EPFL Light Fields dataset, which comprises 118 plenoptic pictures, as well as the dataset created by our laboratory, which has 6000 plenoptic images captured by Lytro Illum.…”
Section: Introductionmentioning
confidence: 99%