Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation

Cui, Zhiwen; Wang, Jing; Gao, Shangyin; Guo, Ting; Feng, Yihui; Bai, Bo

doi:10.1109/cvpr46437.2021.01039

Cited by 109 publications

(58 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Quantization gains are derived from the multi-rate codec proposed in [18]. For each frame type f ∈ {I, P, B}, a feature-wise pair of gains (Γ enc f , Γ dec f ) is learned.…”

Section: Variable Quantization Gainsmentioning

confidence: 99%

AIVC: Artificial Intelligence based Video Codec

Ladune¹,

Philippe²

2022

Preprint

View full text Add to dashboard Cite

This paper introduces AIVC, an end-to-end neural video codec. It is based on two conditional autoencoders MNet and CNet, for motion compensation and coding. AIVC learns to compress videos using any coding configurations through a single end-to-end rate-distortion optimization. Furthermore, it offers performance competitive with the recent video coder HEVC under several established test conditions. A comprehensive ablation study is performed to evaluate the benefits of the different modules composing AIVC. The implementation is made available at https: //orange-opensource.github.io/AIVC/.

show abstract

“…Quantization gains are derived from the multi-rate codec proposed in [18]. For each frame type f ∈ {I, P, B}, a feature-wise pair of gains (Γ enc f , Γ dec f ) is learned.…”

Section: Variable Quantization Gainsmentioning

confidence: 99%

AIVC: Artificial Intelligence based Video Codec

Ladune¹,

Philippe²

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Currently, most prevailing neural image codecs follow the VAE framework [5,6]. A series of works are built upon this framework, improving from the aspects of entropy estimation [11,18,26,40], quantization [1,3,5,19,51], variable rate [12,13] and perceptual quality [4,38]. Among them, we note that the autoregressive context model [26,40] can achieve obvious rate savings but bring much more decoding complexity.…”

Section: Lossy Image Compressionmentioning

confidence: 99%

Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression

Guo¹,

Feng²,

Zhang³

et al. 2021

Preprint

View full text Add to dashboard Cite

In this paper, we present the first neural video codec that can compete with the latest coding standard H.266/VVC in terms of sRGB PSNR on UVG dataset for the low-latency mode. Existing neural hybrid video coding approaches rely on optical flow or Gaussian-scale flow for prediction, which cannot support fine-grained adaptation to diverse motion content. Towards more content-adaptive prediction, we propose a novel cross-scale prediction module that achieves more effective motion compensation. Specifically, on the one hand, we produce a reference feature pyramid as prediction sources, then transmit cross-scale flows that leverage the feature scale to control the precision of prediction. On the other hand, we introduce the mechanism of weighted prediction into the scenario of prediction with a single reference frame, where cross-scale weight maps are transmitted to synthesize a fine prediction result. In addition to the cross-scale prediction module, we further propose a multi-stage quantization strategy, which improves the rate-distortion performance with no extra computational penalty during inference. We show the encouraging performance of our efficient neural video codec (ENVC) on several common benchmark datasets and analyze in detail the effectiveness of every important component.

show abstract

“…To address this limitation, advanced methods [8,28,32] propose conditional entropy models where the elements are assumed to follow conditionally independent parametric probability models, and the distribution parameters are adapted by utilizing the remaining dependencies. They can be divided into two directions: what parametric models to be used [8,16,17,32] and how to model dependencies [8,28,32,37]. The former direction includes zero-mean Gaussian [8], Gaussian [32], Gaussian mixture [16], and asymmetric Gaussian [17].…”

Section: Learned Entropy Modelsmentioning

confidence: 99%

“…They can be divided into two directions: what parametric models to be used [8,16,17,32] and how to model dependencies [8,28,32,37]. The former direction includes zero-mean Gaussian [8], Gaussian [32], Gaussian mixture [16], and asymmetric Gaussian [17]. Among them, we employ the widely used one, i.e., Gaussian [32].…”

Section: Learned Entropy Modelsmentioning

confidence: 99%

Joint Global and Local Hierarchical Priors for Learned Image Compression

Kim¹,

Heo²,

Lee³

2021

Preprint

View full text Add to dashboard Cite

Recently, learned image compression methods have shown superior performance compared to the traditional hand-crafted image codecs including BPG. One of the fundamental research directions in learned image compression is to develop entropy models that accurately estimate the probability distribution of the quantized latent representation. Like other vision tasks, most of the recent learned entropy models are based on convolutional neural networks (CNNs). However, CNNs have a limitation in modeling dependencies between distant regions due to their nature of local connectivity, which can be a significant bottleneck in image compression where reducing spatial redundancy is a key point. To address this issue, we propose a novel entropy model called Information Transformer (Informer) that exploits both local and global information in a content-dependent manner using an attention mechanism. Our experiments demonstrate that Informer improves ratedistortion performance over the state-of-the-art methods on the Kodak and Tecnick datasets without the quadratic computational complexity problem.

show abstract

Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation

Cited by 109 publications

References 16 publications

AIVC: Artificial Intelligence based Video Codec

AIVC: Artificial Intelligence based Video Codec

Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression

Joint Global and Local Hierarchical Priors for Learned Image Compression

Contact Info

Product

Resources

About