2020
DOI: 10.1109/tmm.2019.2961504
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reference Generation With Multi-Domain Hierarchical Constraints for Inter Prediction

Abstract: Inter prediction is an important module in video coding for temporal redundancy removal, where similar reference blocks are searched from previously coded frames and employed to predict the block to be coded. Although traditional video codecs can estimate and compensate for block-level motions, their inter prediction performance is still heavily affected by the remaining inconsistent pixel-wise displacement caused by irregular rotation and deformation. In this paper, we address the problem by proposing a deep … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 27 publications
(4 citation statements)
references
References 46 publications
0
4
0
Order By: Relevance
“…Zhao et al 17 proposed a CNN-based enhancement model to replace the traditional frame rate up conversion (FRUC) algorithm in both frame and coding tree unit (CTU) level. Liu et al 18 proposed a multi-scale quality attentive factorized kernel convolutional neural network (MQ-FKCNN) to generate additional reference frames. Huo et al 19 have proposed a deep network-based frame extrapolation method using reference frame alignment for HEVC and VVC.…”
Section: Deep Learning-based Inter Prediction In Video Codingmentioning
confidence: 99%
“…Zhao et al 17 proposed a CNN-based enhancement model to replace the traditional frame rate up conversion (FRUC) algorithm in both frame and coding tree unit (CTU) level. Liu et al 18 proposed a multi-scale quality attentive factorized kernel convolutional neural network (MQ-FKCNN) to generate additional reference frames. Huo et al 19 have proposed a deep network-based frame extrapolation method using reference frame alignment for HEVC and VVC.…”
Section: Deep Learning-based Inter Prediction In Video Codingmentioning
confidence: 99%
“…Many previous works have showed the possibility of rate-distortion (RD) performance improvement by using deep learning in the subprocess during coding, e.g., block up-sampling [10], in-loop filtering [11], fractional interpolation [12], deep reference interpolation of B frame [13], etc. Liu et al proposed a deep frame interpolation network [14] to generate additional deep reference pictures for B frames.…”
Section: Introductionmentioning
confidence: 99%
“…In early researches [3][4][5], their models are not capable enough to consider beyond their own purposes (only for one of human vision or machine vision). Even in the recent decade, most of the enhancement methods, e.g., dehazing [6][7][8][9], deraining [10][11][12][13][14][15][16][17][18][19], illumination enhancement [20][21][22][23][24][25], image/video compression [26][27][28], compression artifact removal [29,30], and copymove forgery detection [31], only target human vision and most of image/video understanding and analytics methods, e.g. classification [32], segmentation [33], action recognition [34], and human pose estimation [35], are considered to take the clean and high-quality images as the input of a system.…”
Section: Introductionmentioning
confidence: 99%