2020
DOI: 10.1007/978-3-030-51935-3_22
|View full text |Cite
|
Sign up to set email alerts
|

A Deep CNN-LSTM Framework for Fast Video Coding

Abstract: High Efficiency Video Coding (HEVC) doubles the compression rates over the previous H.264 standard for the same video quality. To improve the coding efficiency, HEVC adopts the hierarchical quadtree structured Coding Unit (CU). However, the computational complexity significantly increases due to the full search for Rate-Distortion Optimization (RDO) to find the optimal Coding Tree Unit (CTU) partition.Here, this paper proposes a deep learning model to predict the HEVC CU partition at inter-mode, instead of bru… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…These features would then be passed to the LSTM layers that produces an output that accounts for the entire sequence. Convolutions of 2-dimensional data are usually applied to contexts where the elements of the sequence can be represented as an image, such as traffic prediction [42] or video [43].…”
Section: ) Cnn-lstmmentioning
confidence: 99%
“…These features would then be passed to the LSTM layers that produces an output that accounts for the entire sequence. Convolutions of 2-dimensional data are usually applied to contexts where the elements of the sequence can be represented as an image, such as traffic prediction [42] or video [43].…”
Section: ) Cnn-lstmmentioning
confidence: 99%
“…On the other hand, the search of the optimal CU prediction mode can be modeled as classification problem. In this regard, researchers adopted learning-based methods in classifying CU mode decision in order to reduce the computational complexity [16][17][18][19][20][21][22]. Shen and Yu [16] proposed a CU early termination algorithm for each level of the quadtree CU partition based on weighted SVM.…”
Section: Related Work Overviewmentioning
confidence: 99%
“…Recently, deep learning (a branch of artificial intelligence) has seen great success in computer vision tasks [ 24 , 25 ], especially for video encoding [ 26 28 ]. Indeed, deep neural networks have been adopted to improve coding tools, including intra- and inter-prediction, transformation, quantization, and loop filtering for HEVC and VVC standards.…”
Section: Related Workmentioning
confidence: 99%