2022 Data Compression Conference (DCC) 2022
DOI: 10.1109/dcc52660.2022.00009
|View full text |Cite
|
Sign up to set email alerts
|

Beyond Keypoint Coding: Temporal Evolution Inference with Compact Feature Representation for Talking Face Video Compression

Abstract: In this paper, we propose a novel framework for Interactive Face Video Coding (IFVC), which allows humans to interact with the intrinsic visual representations instead of the signals. The proposed solution enjoys several distinct advantages, including ultra-compact representation, low delay interaction, and vivid expression and headpose animation. In particular, we propose the Internal Dimension Increase (IDI) based representation, greatly enhancing the fidelity and flexibility in rendering the appearance whil… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 18 publications
(6 citation statements)
references
References 90 publications
0
6
0
Order By: Relevance
“…• CFTE [16]: GAN-based model with deep learning based feature representation. Each reference video is compressed by these codecs to three to five compression levels.…”
Section: B Face Video Compressionmentioning
confidence: 99%
See 3 more Smart Citations
“…• CFTE [16]: GAN-based model with deep learning based feature representation. Each reference video is compressed by these codecs to three to five compression levels.…”
Section: B Face Video Compressionmentioning
confidence: 99%
“…In [47], it shows that DISTS and LPIPS deliver average-level yet the best performance on generative image compression among all IQA methods. Though the prediction accuracy is unsatisfactory, the two measures are still used to evaluate the performance of generative face video compression [16] because of the computational efficiency. Video quality assessment involves the temporal domain which significantly influences perceptual quality.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…The First Motion Order Model (FOMM) [7] proposed to describe the face based on keypoints and this framework was further investigated in [8][9]. In [10], the face evolution was extracted through the compact features. In Face-vid2vid [11], the head motion is encoded based on a novel keypoint representation where the identity-specific and motion-related information is decomposed using unsupervised method.…”
Section: Introductionmentioning
confidence: 99%