2022
DOI: 10.48550/arxiv.2205.15278
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model

Abstract: motion model's properties, we further propose an Implicit Emotion Displacement Learner to represent emotion-related facial dynamics as linearly additive displacements to the previously acquired motion representations.Comprehensive experiments demonstrate that by incorporating the results from both modules, our method can generate satisfactory talking face results on arbitrary subjects with realistic emotion patterns.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(13 citation statements)
references
References 46 publications
0
13
0
Order By: Relevance
“…We evaluate Diffused Heads on the most commonly used datasets for talking face generation: CREMA [2] and LRW [4]. We compare our method qualitatively and quantitatively to the current state-of-the-art in guided [15,24,45,46] and pose guidance-free [40] video synthesis. To experience the full quality of our results, readers are strongly encouraged to watch generated videos in the supplementary materials.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…We evaluate Diffused Heads on the most commonly used datasets for talking face generation: CREMA [2] and LRW [4]. We compare our method qualitatively and quantitatively to the current state-of-the-art in guided [15,24,45,46] and pose guidance-free [40] video synthesis. To experience the full quality of our results, readers are strongly encouraged to watch generated videos in the supplementary materials.…”
Section: Methodsmentioning
confidence: 99%
“…The paper [29] introduces a Portrait Image Neural Renderer (PIRenderer) that controls the face motions with the parameters of a three-dimensional morphable face model. In [15], Implicit Emotion Displacement Learner, together with Dense Warping Field, are used to obtain high-quality images.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations