2019
DOI: 10.1109/access.2019.2935378
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning-Based Luma and Chroma Fractional Interpolation in Video Coding

Abstract: Motion compensated prediction is one of the essential methods to reduce temporal redundancy in inter coding. The target of motion compensated prediction is to predict the current frame from the list of reference frames. Recent video coding standards commonly use interpolation filters to obtain sub-pixel for the best matching block located in the fractional position of the reference frame. However, the fixed filters are not flexible to adapt to the variety of natural video contents. Inspired by the success of C… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…Lastly, CompetitionCNN's coding complexity for CTC and non-CTC sequences is outlined in Table IV. While prior approaches that had been proposed for HEVC required GPU coding environments [16], [17], [20], the proposed approach can be applied in conventional CPU-based codec environments, with tolerable complexity increases in both encoding and decoding time. In [8], a full three-layer network was implemented within VVC and demonstrated increases in encoding complexity by up to 380 times.…”
Section: E Compression Performance Evaluationmentioning
confidence: 99%
See 2 more Smart Citations
“…Lastly, CompetitionCNN's coding complexity for CTC and non-CTC sequences is outlined in Table IV. While prior approaches that had been proposed for HEVC required GPU coding environments [16], [17], [20], the proposed approach can be applied in conventional CPU-based codec environments, with tolerable complexity increases in both encoding and decoding time. In [8], a full three-layer network was implemented within VVC and demonstrated increases in encoding complexity by up to 380 times.…”
Section: E Compression Performance Evaluationmentioning
confidence: 99%
“…Application of linear segments has already been presented in the context of intra-prediction for video coding [22]. Furthermore, in previous work [8], a simplified CNN-based filtering process was proposed in the context of inter-prediction in VVC, as opposed to the approach in [14], [16], [20] which used HEVC as its baseline encoder. Due to the fact VVC is considerably more efficient than HEVC, it is a much more challenging baseline from which to achieve compression efficiency gains.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…An approach to using super-resolution CNNs to generate half-pixel interpolated fractional samples was introduced in [14], reporting 0.9% Bjntegaard delta-rate (BD-rate) [15] reductions under lowdelay P (LDP) configuration when replacing HEVC luma filters. Training separate networks for luma and chroma channels was presented in [16]. The resulting models were integrated within the HEVC reference software as a switchable interpolation filter, achieving 2.9% BD-rate coding gains under the LDP configuration.…”
Section: State Of the Artmentioning
confidence: 99%