The Speaker and Language Recognition Workshop (Odyssey 2018) 2018
DOI: 10.21437/odyssey.2018-28
|View full text |Cite
|
Sign up to set email alerts
|

The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods

Abstract: We present the Voice Conversion Challenge 2018, designed as a follow up to the 2016 edition with the aim of providing a common framework for evaluating and comparing different state-of-the-art voice conversion (VC) systems. The objective of the challenge was to perform speaker conversion (i.e. transform the vocal identity) of a source speaker to a target speaker while maintaining linguistic information. As an update to the previous challenge, we considered both parallel and nonparallel data to form the Hub and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
186
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
4
1

Relationship

3
7

Authors

Journals

citations
Cited by 263 publications
(189 citation statements)
references
References 34 publications
2
186
0
1
Order By: Relevance
“…We compared the proposed CC-GAN based method with two conventional approaches, i.e., the CycleGAN based VC [10] and the StarGAN based VC [18], using the VCC2018 corpus [20]. The speakers in the corpus consist of 6 males and 6 females.…”
Section: Methodsmentioning
confidence: 99%
“…We compared the proposed CC-GAN based method with two conventional approaches, i.e., the CycleGAN based VC [10] and the StarGAN based VC [18], using the VCC2018 corpus [20]. The speakers in the corpus consist of 6 males and 6 females.…”
Section: Methodsmentioning
confidence: 99%
“…We used sprocket [32], an open-source implementation of GMM-based voice conversion [13]. It was used as a baseline in the 2018 Voice Conversion Challenge [36] and it thus gives us a solid anchor when comparing to other VC methods. We tuned the hyper parameters of the GMM conversion model on the internal validation set (see Section IV-A).…”
Section: B Gmm Whisper Conversionmentioning
confidence: 99%
“…In the similarity question, the participant is asked to judge the similarity between the utterance generated by one VC system and a natural utterance spoken by the target speaker in a 4-point scale. The setup is the same as in VCC2018 [38].…”
Section: Voice Conversion Challenge 2018 Spoke Taskmentioning
confidence: 99%