2021
DOI: 10.1007/s00521-021-06491-9
|View full text |Cite
|
Sign up to set email alerts
|

Deep learning-based video quality enhancement for the new versatile video coding

Abstract: Multimedia IoT (M-IoT) is an emerging type of Internet of things (IoT) relaying multimedia data (images, videos, audio and speech, etc.). The rapid growth of M-IoT devices enables the creation of a massive volume of multimedia data with different characteristics and requirements. With the development of artificial intelligence (AI), AI-based multimedia IoT systems have been recently designed and deployed for various video-based services for contemporary daily life, like video surveillance with high definition … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
10

Relationship

0
10

Authors

Journals

citations
Cited by 18 publications
(5 citation statements)
references
References 36 publications
0
5
0
Order By: Relevance
“…By presenting this algorithm, the authors enhanced the coding performance of HEVC. Soulef Bouaafia et al [31] had presented a wide-activated squeeze-and-excitation deep CNN to enhance the quality for versatile video coding. The authors had enhanced the performance in terms of RD cost.…”
Section: Related Workmentioning
confidence: 99%
“…By presenting this algorithm, the authors enhanced the coding performance of HEVC. Soulef Bouaafia et al [31] had presented a wide-activated squeeze-and-excitation deep CNN to enhance the quality for versatile video coding. The authors had enhanced the performance in terms of RD cost.…”
Section: Related Workmentioning
confidence: 99%
“…ML-based solutions are proposed for HTTP adaptive streaming [14], and a reinforcement learning framework is introduced for frame-level bit allocation in HEVC/H.265 [15]. A deep convolutional neural network (DCNN) is employed for enhancing video quality in versatile video coding (VVC) [16], and human vision models and ML are leveraged for H.266/VVC encoding [17]. Deep learning is used for video streaming over the next-generation network,…”
Section: ░ 2 Related Researchmentioning
confidence: 99%
“…Overall, the proportion of QT splitting is minimal. In summary, if QT partitioning is only predicted early, the reduction in coding complexity is limited [25]. Therefore, in this paper, the FSVM classifier and DAG-SVM classifier are used to determine whether CUs will continue to be divided, as well as the direction of division, which can effectively reduce coding complexity.…”
Section: Statistical Analysis Of Qtmt Division Structurementioning
confidence: 99%