2021
DOI: 10.48550/arxiv.2107.03748
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer

Abstract: Traditional voice conversion (VC) has been focused on speaker identity conversion for speech with a neutral expression. We note that emotional expression plays an essential role in daily communication, and the emotional style of speech can be speaker-dependent. In this paper, we study the technique to jointly convert the speaker identity and speakerdependent emotional style, that is called expressive voice conversion. We propose a StarGAN-based framework to learn a many-to-many mapping across different speaker… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 62 publications
0
1
0
Order By: Relevance
“…Despite recent progress, modeling prosody from expressive speech [10] for style transfer with voice conversion framework is still a challenging task. Besides linguistic information, transferring the source prosody to the target is vital for many voice conversion tasks, including automatic dubbing 1 for movies in which conversations are emotional in nature.…”
Section: Introductionmentioning
confidence: 99%
“…Despite recent progress, modeling prosody from expressive speech [10] for style transfer with voice conversion framework is still a challenging task. Besides linguistic information, transferring the source prosody to the target is vital for many voice conversion tasks, including automatic dubbing 1 for movies in which conversations are emotional in nature.…”
Section: Introductionmentioning
confidence: 99%