2018 IEEE Spoken Language Technology Workshop (SLT) 2018
DOI: 10.1109/slt.2018.8639535
|View full text |Cite
|
Sign up to set email alerts
|

StarGAN-VC: non-parallel many-to-many Voice Conversion Using Star Generative Adversarial Networks

Abstract: We have previously proposed a method that allows for non-parallel voice conversion (VC) by using a variant of generative adversarial networks (GANs) called StarGAN. The main features of our method, called StarGAN-VC, are as follows: First, it requires no parallel utterances, transcriptions, or time alignment procedures for speech generator training. Second, it can simultaneously learn mappings across multiple domains using a single generator network so that it can fully exploit available training data collecte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
242
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 325 publications
(243 citation statements)
references
References 61 publications
(89 reference statements)
1
242
0
Order By: Relevance
“…We compared the proposed CC-GAN based method with two conventional approaches, i.e., the CycleGAN based VC [10] and the StarGAN based VC [18], using the VCC2018 corpus [20]. The speakers in the corpus consist of 6 males and 6 females.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…We compared the proposed CC-GAN based method with two conventional approaches, i.e., the CycleGAN based VC [10] and the StarGAN based VC [18], using the VCC2018 corpus [20]. The speakers in the corpus consist of 6 males and 6 females.…”
Section: Methodsmentioning
confidence: 99%
“…'H', 'W', 'C', 'K', and 'S' represent height, width, number of channels, kernel size, and stride, respectively. [18] for the comparison as it was impractical to train all 66 CycleGAN models due to the time and memory constraints. On the other hand, since we were not able to reproduce the same results as in [18] for the StarGAN based VC, we downloaded the converted speech samples from the author's website and used them for the comparative evaluations.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations