2020 IEEE International Conference on Image Processing (ICIP) 2020
DOI: 10.1109/icip40778.2020.9191193
|View full text |Cite
|
Sign up to set email alerts
|

Interpreting CNN For Low Complexity Learned Sub-Pixel Motion Compensation In Video Coding

Abstract: Deep learning has shown great potential in image and video compression tasks. However, it brings bit savings at the cost of significant increases in coding complexity, which limits its potential for implementation within practical applications. In this paper, a novel neural network-based tool is presented which improves the interpolation of reference samples needed for fractional precision motion compensation. Contrary to previous efforts, the proposed approach focuses on complexity reduction achieved by inter… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
14
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 13 publications
(14 citation statements)
references
References 16 publications
0
14
0
Order By: Relevance
“…Methods based on Convolutional Neural Networks (CNNs) [2], [4] provided significant improvements at the cost of two main drawbacks: the associated increase in system complexity and the tendency to disregard the location of individual reference samples. Related works deployed complex neural networks (NNs) by means of model-based interpretability [5]. For instance, VVC recently adopted simplified NN-based methods such as Matrix Intra Prediction (MIP) modes [6] and Low-Frequency Non Separable Transform (LFNST) [7].…”
Section: Introductionmentioning
confidence: 99%
“…Methods based on Convolutional Neural Networks (CNNs) [2], [4] provided significant improvements at the cost of two main drawbacks: the associated increase in system complexity and the tendency to disregard the location of individual reference samples. Related works deployed complex neural networks (NNs) by means of model-based interpretability [5]. For instance, VVC recently adopted simplified NN-based methods such as Matrix Intra Prediction (MIP) modes [6] and Low-Frequency Non Separable Transform (LFNST) [7].…”
Section: Introductionmentioning
confidence: 99%
“…Given the successful application of ML to super-resolution applications, neural networks may be employed within a video coding scheme to produce new, more efficient interpolation filters. In a previous work [8], an approach for CNN-based sub-pixel interpolation filters for fractional motion estimation was presented in the context of the VVC standard. This is the first approach that shows gains in the challenging VVC framework, while also reducing the computational requirements to an acceptable level.…”
Section: Introductionmentioning
confidence: 99%
“…This is the first approach that shows gains in the challenging VVC framework, while also reducing the computational requirements to an acceptable level. This paper expands on such work [8], uncovering new ways to boost the prediction performance of the learned interpolation, without affecting the computational requirements of the prediction process. In particular this paper introduces:…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations