2020
DOI: 10.48550/arxiv.2004.04342
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Feedback Recurrent Autoencoder for Video Compression

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 0 publications
0
7
0
Order By: Relevance
“…Moreover, there has been a surge in leveraging Variational Autoencoders for data compression [50]. The experimental results presented in Section 5.2.1 are a strong indicator that the proposed architecture can realize smaller reconstruction loss with fewer latent dimensions (100 vs 128 latent variables).…”
Section: Broader Impactmentioning
confidence: 99%
“…Moreover, there has been a surge in leveraging Variational Autoencoders for data compression [50]. The experimental results presented in Section 5.2.1 are a strong indicator that the proposed architecture can realize smaller reconstruction loss with fewer latent dimensions (100 vs 128 latent variables).…”
Section: Broader Impactmentioning
confidence: 99%
“…Another popular direction (which this work follows) is to design a low-latency ML-based codec, which only features keyframe compression and forward frame extrapolation (i.e I/P-frames only) [29,24,27]. Promising recent directions involve modeling motion using scale-space flow [2] and resolution-adaptive flow [25,16], propagating a latent state [34,13], and explicitly mitigating error propagation [28]. Yet another promising approach [14] revolves around using spatiotemporal autoencoders to encode chunks of frames.…”
Section: Ml-based Video Compressionmentioning
confidence: 99%
“…Figure 7: Rate-distortion curves of traditional codecs and state-of-the-art ML codecs[2,29,13,14,27,28,25,44,42] on the UVG and MCL-JCV video datasets.…”
mentioning
confidence: 99%
“…Vimeo-90k [11] consists of 90,000 clips of 7 frames at 448x256 resolution collected from vimeo.com, which has been used in previous works [10], [35], [58] [59] only contains human action videos, and previous methods that trained on Kinetics [32], [47], [60] generally report worse rate-distortion performance on diverse benchmarks (such as UVG, to be discussed below), compared to [4] which reportedly is trained on a significantly larger dataset with high resolution collected from youtube.com.…”
Section: Training Datasetsmentioning
confidence: 99%