2022
DOI: 10.48550/arxiv.2204.12022
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Estimating the Resize Parameter in End-to-end Learned Image Compression

Abstract: We describe a search-free resizing framework that can further improve the rate-distortion tradeoff of recent learned image compression models. Our approach is simple: compose a pair of differentiable downsampling/upsampling layers that sandwich a neural compression model. To determine resize factors for different inputs, we utilize another neural network jointly trained with the compression model, with the end goal of minimizing the rate-distortion objective. Our results suggest that "compression friendly" dow… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(7 citation statements)
references
References 43 publications
(54 reference statements)
0
7
0
Order By: Relevance
“…These simple approaches, however, can suffer from detail loss and artifacts, and the degraded image quality might hamper the performance of downstream visual recognition tasks, especially when the resizing factor is large. Some recent works have explored to use learning-based methods for image downscaling to enhance the desired content in the resized images from training data [3,4,20,33,45,61]. For example, the authors of [3,4] proposed a residual CNN module for downscaling, and jointly trained it with an image compression network to generate "compressionfriendly" representations.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…These simple approaches, however, can suffer from detail loss and artifacts, and the degraded image quality might hamper the performance of downstream visual recognition tasks, especially when the resizing factor is large. Some recent works have explored to use learning-based methods for image downscaling to enhance the desired content in the resized images from training data [3,4,20,33,45,61]. For example, the authors of [3,4] proposed a residual CNN module for downscaling, and jointly trained it with an image compression network to generate "compressionfriendly" representations.…”
Section: Related Workmentioning
confidence: 99%
“…2), and discuss how we employ it for training several popular vision tasks. Unlike previous proposed resizers [3,45], we aim to keep the computational cost of the model as low as possible such that it can replace existing resizers (e.g., bilinear) without extra cost, but also there is a notable performance gain. Our proposed approach is different in that (1) it is orders of magnitude faster, hence more scalable (to large image size), (2) it only has a handful of parameters which allows for better generalization, (3) it adds almost no extra training cost to the system.…”
Section: Proposed Approachmentioning
confidence: 99%
See 2 more Smart Citations
“…The authors of [8,18] studied the temporally bandpass filtered statistics of UGC and variable frame-rate videos. Deep learning has significantly advanced the state-of-the-arts on various vision applications [19][20][21][22]. Recently, several deep learning-based BVQA models have emerged [9,[23][24][25].…”
Section: Related Workmentioning
confidence: 99%