Progressive Spatial Recurrent Neural Network for Intra Prediction

Hu, Yueyu; Yang, Wenhan; Li, Mading; Liu, Jiaying

doi:10.1109/tmm.2019.2920603

Cited by 63 publications

(25 citation statements)

References 51 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…One would intuitively expect that coding performance can be further improved if better predictions can be produced. Therefore, there have been a number of attempts to leverage the powerful capacity of stacked DNNs for better intrapredictor generation, including the CNN-based predictor refinement suggested in [113] to reduce prediction residual, additional learned mode trained using FCN models reported in [114] and [115], using RNNs in [116], using CNNs in [108], even using GANs in [117], and so on. These approaches have actively utilized the neighbor pixels or blocks and/or other context information (e.g., mode) if applicable, in order to accurately represent the local structures for better prediction.…”

Section: A Modularized Neural Video Codingmentioning

confidence: 99%

Advances in Video Compression System Using Deep Neural Network: A Review and Case Studies

Ding

Chen

et al. 2021

Proc. IEEE

View full text Add to dashboard Cite

Significant advances in video compression systems have been made in the past several decades to satisfy the near-exponential growth of Internet-scale video traffic. From the application perspective, we have identified three major functional blocks, including preprocessing, coding, and postprocessing, which have been continuously investigated to maximize the end-user quality of experience (QoE) under a limited bit rate budget. Recently, artificial intelligence (AI)-powered techniques have shown great potential to further increase the efficiency of the aforementioned functional blocks, both individually and jointly. In this article, we review recent technical advances in video compression systems extensively, with an emphasis on deep neural network (DNN)based approaches, and then present three comprehensive case studies. On preprocessing, we show a switchable texturebased video coding example that leverages DNN-based scene understanding to extract semantic areas for the improvement Manuscript

show abstract

Section: A Modularized Neural Video Codingmentioning

confidence: 99%

Advances in Video Compression System Using Deep Neural Network: A Review and Case Studies

Ding

Chen

et al. 2021

Proc. IEEE

View full text Add to dashboard Cite

show abstract

“…In [16,17], a fully connected neural networks was used to construct the relation between boundary reconstructed pixels and original pixels in current block, and about 2%-4% bit-rate saving could be achieved. The researchers in [18] applied the recurrent neural network to progressively generate the prediction signal of current block, and improved the coding efficiency obviously. At almost the same time, the attention mechanism have become one hotspot in the literature [19][20][21][22].…”

Section: Introductionmentioning

confidence: 99%

Merging Multiple Template Matching Predictions in Intra Coding with Attentive Convolutional Neural Network

Wang

Zheng

2021

Proceedings of the 29th ACM International Conference on Multimedia

View full text Add to dashboard Cite

In intra coding, template matching prediction is an effective method to reduce the non-local redundancy inside image content. However, the prediction indicated by the best template matching is not always the actually best prediction. To solve this problem, we propose a method, which merges multiple template matching predictions through a convolutional neural network with attention module. The convolutional neural network aims at exploring different combinations of the candidate template matching predictions, and the attention module focuses on determining the most significant prediction candidate. Besides, the spatial module in attention mechanism can be utilized to model the relationship between the original pixels in current block and the reconstructed pixels in adjacent regions (template). Compared to the directional intra prediction and traditional template matching prediction, our method can provide a unified framework to generate prediction with high accuracy. The experimental results show that, compared the averaging strategy, the BD-rate reductions can reach up to 4.7%, 5.5% and 18.3% on the classic standard sequences (classB-classF), SIQAD dataset (screen content), and Urban100 dataset (natural scenes) respectively, while the average bit rate saving are 0.5%, 2.7% and 1.8%, respectively.

show abstract

“…Dumas et al [5] stated that using convolutional neural network (CNN) performs better than FC for blocks larger than 8×8. Hu et al [6] presented a new structure based on recurrent neural network (RNN). Sun et al [7] studied different combination schemes of traditional modes (TM) and neural network modes (NM) for the fixed block 8×8.…”

Section: Introductionmentioning

confidence: 99%

“…First is that TM still remains in the coding framework. In [4], [5] and [6], one or two NMs were provided. In [7], at most seven NMs were exploited.…”

Section: Introductionmentioning

confidence: 99%

Fully Neural Network Mode Based Intra Prediction of Variable Block Size

Sun

Katto

2020

2020 IEEE International Conference on Visual Communications and Image Processing (VCIP)

View full text Add to dashboard Cite

Intra prediction is an essential component in the image coding. This paper gives an intra prediction framework completely based on neural network modes (NM). Each NM can be regarded as a regression from the neighboring reference blocks to the current coding block. (1) For variable block size, we utilize different network structures. For small blocks 4×4 and 8×8, fully connected networks are used, while for large blocks 16×16 and 32×32, convolutional neural networks are exploited. (2) For each prediction mode, we develop a specific pre-trained network to boost the regression accuracy. When integrating into HEVC test model, we can save 3.55%, 3.03% and 3.27% BD-rate for Y, U, V components compared with the anchor. As far as we know, this is the first work to explore a fully NM based framework for intra prediction, and we reach a better coding gain with a lower complexity compared with the previous work.

show abstract

Progressive Spatial Recurrent Neural Network for Intra Prediction

Cited by 63 publications

References 51 publications

Advances in Video Compression System Using Deep Neural Network: A Review and Case Studies

Advances in Video Compression System Using Deep Neural Network: A Review and Case Studies

Merging Multiple Template Matching Predictions in Intra Coding with Attentive Convolutional Neural Network

Fully Neural Network Mode Based Intra Prediction of Variable Block Size

Contact Info

Product

Resources

About