2023
DOI: 10.1109/tmm.2022.3208516
|View full text |Cite
|
Sign up to set email alerts
|

Efficient VVC Intra Prediction Based on Deep Feature Fusion and Probability Estimation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(10 citation statements)
references
References 58 publications
0
10
0
Order By: Relevance
“…Deep learning is a hot method in image processing in recent years, and researchers have constructed classifiers to achieve fast CU split by deep learning. In the literature [20], a three-level CNN network with early termination is designed and a CU partition map that can represent the three-level deep CU split is constructed as the output of the network, in the literature [21], a CNN convolutional neural network with asymmetric kernel is designed, a partition rate is proposed to make the network applicable to all quantization parameters, and the threshold decision of the network output is transformed into a multi-objective optimization problem to weigh the computational complexity and algorithm performance, in the literature [22,23], a Resnet model is proposed to predict the CTU partition of HEVC standard, in the literature [24], a fast decision algorithm for intra-frame modes based on convolutional neural networks is proposed by scaling prediction units of different sizes to CU of the same size by bilinear interpolation, followed by model learning to achieve fast decision making, in the literature [25] designed a three-level MSE-CNN network, which has a structure related to the kernel size and CU size, and designed an adaptive cross-entropy activation function to solve the problem of imbalance between different splitting cases, and literature [26] used a deep convolutional network to fuse all reference features obtained from varying convolutional kernels after extracting the spatio-temporal corresponding coding features to finally determine the intra-frame coding depth. And using probabilistic model and spatio-temporal coherence based to select the candidate split mode with the best encoding depth.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Deep learning is a hot method in image processing in recent years, and researchers have constructed classifiers to achieve fast CU split by deep learning. In the literature [20], a three-level CNN network with early termination is designed and a CU partition map that can represent the three-level deep CU split is constructed as the output of the network, in the literature [21], a CNN convolutional neural network with asymmetric kernel is designed, a partition rate is proposed to make the network applicable to all quantization parameters, and the threshold decision of the network output is transformed into a multi-objective optimization problem to weigh the computational complexity and algorithm performance, in the literature [22,23], a Resnet model is proposed to predict the CTU partition of HEVC standard, in the literature [24], a fast decision algorithm for intra-frame modes based on convolutional neural networks is proposed by scaling prediction units of different sizes to CU of the same size by bilinear interpolation, followed by model learning to achieve fast decision making, in the literature [25] designed a three-level MSE-CNN network, which has a structure related to the kernel size and CU size, and designed an adaptive cross-entropy activation function to solve the problem of imbalance between different splitting cases, and literature [26] used a deep convolutional network to fuse all reference features obtained from varying convolutional kernels after extracting the spatio-temporal corresponding coding features to finally determine the intra-frame coding depth. And using probabilistic model and spatio-temporal coherence based to select the candidate split mode with the best encoding depth.…”
Section: Background and Related Workmentioning
confidence: 99%
“…The fast algorithms of H.266/VVC intra coding have been explored to solve the problem of the high computational requirement of H.266/VVC. The methods can be roughly categorized into probability-based [10][11][12], learning-based [13][14][15][16][17][18][19][20], probabilityand learning-based [21,22], texture-based [23][24][25][26], gradient-based [27,28], and texture-and gradient-based [29][30][31] techniques. The related work on the fast intra coding of H.266/VVC is discussed below.…”
Section: Related Workmentioning
confidence: 99%
“…Ref. [11] employs deep convolutional networks to extract spatiotemporal coding features by integrating reference features obtained from diverse convolutional kernels. These fused features are instrumental in determining the coding depth within a given frame.…”
Section: Status Of H266/vvc Research For 2d Videomentioning
confidence: 99%