2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.01031
|View full text |Cite
|
Sign up to set email alerts
|

Learning Image and Video Compression Through Spatial-Temporal Energy Compaction

Abstract: Compression has been an important research topic for many decades, to produce a significant impact on data transmission and storage. Recent advances have shown a great potential of learning image and video compression. Inspired from related works, in this paper, we present an image compression architecture using a convolutional autoencoder, and then generalize image compression to video compression, by adding an interpolation loop into both encoder and decoder sides. Our basic idea is to realize spatial-tempor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
78
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 102 publications
(78 citation statements)
references
References 26 publications
0
78
0
Order By: Relevance
“…Most recently, several end-to-end deep video compression methods have been proposed [8,7,38,9,22,13]. Specifically, Wu et al [38] proposed predicting frames by interpolation from reference frames, and the image compression network of [31] is applied to compress the residual.…”
Section: Deep Video Compressionmentioning
confidence: 99%
See 3 more Smart Citations
“…Most recently, several end-to-end deep video compression methods have been proposed [8,7,38,9,22,13]. Specifically, Wu et al [38] proposed predicting frames by interpolation from reference frames, and the image compression network of [31] is applied to compress the residual.…”
Section: Deep Video Compressionmentioning
confidence: 99%
“…In 2019, Lu et al [22] proposed the Deep Video Compression (DVC) method, in which optical flow is used to estimate the temporal motion, and two auto-encoders are employed to compress the motion and residual, respectively. Meanwhile, in [9], spatial-temporal energy compaction is added into the loss function to improve the performance of video compression. Later, Habibian et al [13] proposed the rate-distortion auto-encoder, which uses an autoregressive prior for video entropy coding.…”
Section: Deep Video Compressionmentioning
confidence: 99%
See 2 more Smart Citations
“…Such proactive, predictive perceptual capabilities enable us to take desired actions and avoid dangerous situations. To make computing machines achieve a similar level to the human perception, motion understanding and representation have been studied in many computer vision tasks such as optical flow [1]- [3], object tracking [4]- [6], action recognition [7], future frame prediction [8], video interpolation [9], and video compression [10]. However, most conventional techniques depend on temporal information from multiple consecutive frames to estimate motions.…”
Section: Introductionmentioning
confidence: 99%