Statistical voice conversion (VC) is a technique to convert specific non-or paralinguistic information while keeping linguistic information unchanged, and speaker conversion has been studied as a typical application of VC for a few decades. To better understand various VC techniques using a freely available common dataset, the Voice Conversion Challenge (VCC) was launched in 2016 and the 2nd challenge was held in 2018. As one of the baseline systems for VCC 2018, we developed open-source VC software called "sprocket", in which not only conventional techniques, such as a trajectory-based conversion method using a Gaussian mixture model (GMM) and a vocoderbased conversion framework but also recently developed techniques, such as a vocoder-free VC framework, have been implemented. Using sprocket, it is possible to 1) easily reproduce converted voices using the VCC datasets and 2) develop VC systems using other parallel speech datasets with fundamental VC functions, such as acoustic feature extraction, time alignment between the source and target features, GMM training, feature conversion, and waveform generation. In this paper, we describe 1) the technical details and use of sprocket, 2) the development of the baseline systems for the HUB and SPOKE tasks of VCC 2018 using sprocket, and 3) the performance of sprocket as a VC system by demonstrating results for our developed baseline systems in VCC 2018.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.