2019
DOI: 10.1109/tcsvt.2018.2878952
|View full text |Cite
|
Sign up to set email alerts
|

Video Compression Based on Spatio-Temporal Resolution Adaptation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
45
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 68 publications
(46 citation statements)
references
References 20 publications
1
45
0
Order By: Relevance
“…It however remains an underdeveloped research area, with most existing work focused on the improvement of conventional coding tools, such as intra prediction [10,11], inter prediction [12,13] and inloop filters [14,15]. Some work has been reported on the use of deep learning based resolution adaptation methods [6,16,17], which have demonstrated resolution adaptation across a wider bitrate range and offer significant coding gains.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…It however remains an underdeveloped research area, with most existing work focused on the improvement of conventional coding tools, such as intra prediction [10,11], inter prediction [12,13] and inloop filters [14,15]. Some work has been reported on the use of deep learning based resolution adaptation methods [6,16,17], which have demonstrated resolution adaptation across a wider bitrate range and offer significant coding gains.…”
Section: Introductionmentioning
confidence: 99%
“…Inspired by our previous work [16], a novel Convolutional Neural Network (CNN) based effective bit depth adaptation approach (EBDA-CNN) is proposed for video compression, which reduces the effective bit depth of an input video before encoding, and reconstructs the original bit depth at the decoder. The data precision and internal bit depth during encoding and decoding remain the same.…”
Section: Introductionmentioning
confidence: 99%
“…This shift has been largely fuelled by the success of deep CNN architectures for single image super-resolution, that have set the state-of-the-art, with recent architectures like VDSR [15], EDSR [16], FSRCNN [17], DRCN [18] and DBPN [4] achieving several dB higher PSNR in the luminance channel of standard image benchmarks for lossless image upscaling. Thus, Afonso et al propose a spatio-temporal resolution adaptation where a CNN-based super-resolution model is used to reconstruct full-resolution content [19]. Li et al [20] introduce the block adaptive resolution coding framework for intra frame coding, where each block within a frame is either downscaled or coded at original resolution and then upscaled with a trained CNN at the decoder side.…”
Section: Related Workmentioning
confidence: 99%
“…Despite the current trend of increasing the capture and display devices spatial resolutions further and further, such resolutions are not necessarily required to represent the critical information of the signal. A well-known strategy consists in downsampling the video prior to encoding and upsampling back to the original resolution after decoding, in an effort to reduce the bitrate and/or the coding complexity for a similar reconstructed output quality [7][8][9][10][11][12][13][14][15]. Several studies have considered applying the down/up-sampling at an image level, showing that coding at a lower resolution performs better, especially at low bitrate [7,8].…”
Section: Introductionmentioning
confidence: 99%
“…Based on the recent advances in the active field of single image Super Resolution (SR) [16,17], several studies have used SR upsampling for ASR-based coding showing increased gains compared to common upsampling filter-banks [11][12][13][14][15]. However, these gains come at the expense of a significant increase in processing complexity, making SR unsuitable for low-complexity encoding/decoding.…”
Section: Introductionmentioning
confidence: 99%