2021
DOI: 10.1109/jstsp.2020.3044482
|View full text |Cite
|
Sign up to set email alerts
|

Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding

Abstract: Neural networks can be successfully used to improve several modules of advanced video coding schemes. In particular, compression of colour components was shown to greatly benefit from usage of machine learning models, thanks to the design of appropriate attention-based architectures that allow the prediction to exploit specific samples in the reference region. However, such architectures tend to be complex and computationally intense, and may be difficult to deploy in a practical video coding pipeline. This wo… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 17 publications
(16 citation statements)
references
References 22 publications
0
16
0
Order By: Relevance
“…The attention-based model proposed in [2], adopts a scheme based on three network branches that are combined to produce prediction samples. The first branch (cross-component boundary branch) extracts cross component features and encodes the colours on the boundary.…”
Section: General Overviewmentioning
confidence: 99%
See 2 more Smart Citations
“…The attention-based model proposed in [2], adopts a scheme based on three network branches that are combined to produce prediction samples. The first branch (cross-component boundary branch) extracts cross component features and encodes the colours on the boundary.…”
Section: General Overviewmentioning
confidence: 99%
“…Finally, the combined features are transformed to actual colours using a third convolutional branch (the prediction head). The multi-model architecture proposed in [2], improves the baseline attention-based approach. Main improvements are associated to its block-size agnostic property as the proposed approach only requires one model for all block sizes.…”
Section: General Overviewmentioning
confidence: 99%
See 1 more Smart Citation
“…Blanch et al [9] presented a neural network architecture for cross component intra prediction, in which an attention module was employed for learning spatial relations. In [10], the authors further improved the model and reduce the complexity. Zhu et al [11] presented a new neural network architecture for chroma prediction.…”
Section: Introductionmentioning
confidence: 99%
“…Block diagram of a modern video encoder (e.g. VVC [7]), with highlighted inter-prediction modules compression tasks such as chroma intra-prediction [6], with very promising results.…”
Section: Introductionmentioning
confidence: 99%