Fast CU Partition Decision Strategy Based on Human Visual System Perceptual Quality

Zhao, Jinchao; Cui, Tengyao; Zhang, Qiuwen

doi:10.1109/access.2021.3110292

Cited by 6 publications

(2 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since the solution of [16] is originally designed for the complexity reduction of QT/BT partitioning scheme, to adapt to the MTT partitioning, horizontal partition modes (BTH and TTH) and vertical partition modes (BTV and TTV) are both grouped as the output of the BH/BV classifier. Zhao et al [17] design a decision tree classifier by using just noticeable difference model threshold, motion pattern and image texture features for fast CU partition decision strategy. Park and Kang [18] use a lightweight neural network model to decide whether to terminate the nested TT block structures subsequent to a quadtree based on the two kinds of features.…”

Section: Related Workmentioning

confidence: 99%

Efficient Partition Decision Based on Visual Perception and Machine Learning for H.266/Versatile Video Coding

et al. 2022

View full text Add to dashboard Cite

H.266/Versatile Video Coding (VVC) is the latest international video coding standard to encode ultra-high-definition video effectively. The quadtree with nested multi-type tree (QT-MTT) structure provides various sizes of coding tree partitioning and allows the nested binary tree (BT) split and ternary tree (TT) split at each QT level. Furthermore, numerous advanced coding tools are equipped in the H.266/VVC encoder. However, the encoding time increases tremendously. Previous researches regarding the fast coding algorithm of H.266/VVC seldom mention perceptual redundancy. This paper utilizes the human vision model of just noticeable difference to extract the visually distinguishable pixels that may affect the visual perception. We observe that the distributions acquired by the horizontal and vertical projections of visually distinguishable pixels within the coding unit are related to their corresponding MTT splitting modes. Therefore, the distributions representing the perceptual information of human vision are used to be the input features of machine learning. Fast MTT decision determined by the random forest models of machine learning is proposed to quickly select the partition for intra coding. Experimental results demonstrate that the proposed method can effectively accelerate intra coding process while maintaining good bitrate and video quality based on the properties of the visual perception. The proposed algorithm provides better performance than the previous work.

show abstract

Section: Related Workmentioning

confidence: 99%

Efficient Partition Decision Based on Visual Perception and Machine Learning for H.266/Versatile Video Coding

et al. 2022

View full text Add to dashboard Cite

show abstract

“…As reported in [9], the proposed method achieved around 30.63% time saving but asking for 3.18% BDBR (Bjontegaard Bitrate) loss [10]. Afterwards, Zhao et al introduced in [11] a Decision Tree classifier based on human visual saliency to classify CU and decide how the CU is partitioned. Experimental results show that the complexity of this method is reduced by about 48.01%, while the increase of BDBR is only 0.79%.…”

Section: Introductionmentioning

confidence: 99%

Learning adaptive motion search for fast versatile video coding in visual surveillance systems

Thanh,

Quang,

Huu

et al. 2023

IET Image Processing

View full text Add to dashboard Cite

Visual surveillance systems have been playing an important role in monitoring and managing at public areas. However, the computational complexity of video compression in these applications is still a great challenge. To meet practical requirements, the authors propose in this paper a low‐complexity surveillance video coding solution in which the most recent Versatile Video Coding (VVC) standard is improved with a novel learning adaptive motion search algorithm. The proposed algorithm is designed based on the temporal motion and spatial texture characteristics of surveillance videos. First, the authors study and define a list of spatial and temporal features which indicates the motion and texture characteristics of surveillance video. These features are used together with a machine learning algorithm to appropriately assign a search range for the VVC motion search. Second, to reduce search points, the authors propose an adaptive Test Zone (TZ) search in which TZ steps are early terminated following the variation of spatial–temporal features. Performance evaluation conducted for a rich set of surveillance videos and relevant benchmarks have shown the superiority of the proposed method, notably with around 33% of encoding time saving when compared with the state‐of‐the art VVC solution and relevant benchmarks while asking for negligible compression loss.

show abstract

BLINC: lightweight bimodal learning for low-complexity VVC intra-coding

Pakdaman

Adelimanesh

Hashemi

2022

J Real-Time Image Proc

View full text Add to dashboard Cite

Fast CU Partition Decision Strategy Based on Human Visual System Perceptual Quality

Cited by 6 publications

References 41 publications

Efficient Partition Decision Based on Visual Perception and Machine Learning for H.266/Versatile Video Coding

Efficient Partition Decision Based on Visual Perception and Machine Learning for H.266/Versatile Video Coding

Learning adaptive motion search for fast versatile video coding in visual surveillance systems

BLINC: lightweight bimodal learning for low-complexity VVC intra-coding

Contact Info

Product

Resources

About