Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-1647
|View full text |Cite
|
Sign up to set email alerts
|

Nonparallel Emotional Speech Conversion Using VAE-GAN

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
10
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 15 publications
0
10
0
Order By: Relevance
“…The generator learns to reconstruct the input features from the latent code z and the emotion identity, the discriminator learns to distinguish the reconstructed features with the real features. Autoencoder is easy to train owing to its simple architecture [73,50,49].…”
Section: Disentanglement Between Emotional Prosody and Linguistic Con...mentioning
confidence: 99%
See 3 more Smart Citations
“…The generator learns to reconstruct the input features from the latent code z and the emotion identity, the discriminator learns to distinguish the reconstructed features with the real features. Autoencoder is easy to train owing to its simple architecture [73,50,49].…”
Section: Disentanglement Between Emotional Prosody and Linguistic Con...mentioning
confidence: 99%
“…The generator is then conditioned on such deep emotional feature or one-hot emotion identity during the training. In this way, we learn the framelevel content-related representations from the speech in an unsupervised manner [50,60,48]. VAE-GAN [50] and VAW-GAN [25,60] are successful attempts.…”
Section: Disentanglement Between Emotional Prosody and Linguistic Con...mentioning
confidence: 99%
See 2 more Smart Citations
“…Lately, Ding et al [12] adopted vector quantized variational autoencoders (VQ-VAE) with the group latent embedding (GLE) for nonparallel data training. Moreover, to better learn the mapping function between non-parallel data distributions, cycle-consistent adversarial network (CycleGAN) [13,14] and variational autoencoder-generative adversarial network (VAE-GAN) [15] were introduced to EVC task. Furthermore, Moritani et al [16] employed starGAN to realize non-parallel spectral envelope transformation.…”
Section: Introductionmentioning
confidence: 99%