2010 IEEE International Conference on Acoustics, Speech and Signal Processing 2010
DOI: 10.1109/icassp.2010.5495139
|View full text |Cite
|
Sign up to set email alerts
|

Non-parallel training for many-to-many eigenvoice conversion

Abstract: This paper presents a novel training method of an eigenvoice Gaussian mixture model (EV-GMM) effectively using non-parallel data sets for many-to-many eigenvoice conversion, which is a technique for converting an arbitrary source speaker's voice into an arbitrary target speaker's voice. In the proposed method, an initial EV-GMM is trained with the conventional method using parallel data sets consisting of a single reference speaker and multiple pre-stored speakers. Then, the initial EV-GMM is further refined u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 16 publications
(7 citation statements)
references
References 12 publications
0
7
0
Order By: Relevance
“…To address this challenge, the proposed training algorithm of IC-GMR should be extended to the case of non-parallel corpus as considered in [35]. Moreover, the use of the C-GMR framework (and notably the IC-GMR approach) could be envisioned in other speech processing areas, such as silent speech interfaces [36] which are devices converting speech-related biosignals (e.g.…”
Section: Discussionmentioning
confidence: 99%
“…To address this challenge, the proposed training algorithm of IC-GMR should be extended to the case of non-parallel corpus as considered in [35]. Moreover, the use of the C-GMR framework (and notably the IC-GMR approach) could be envisioned in other speech processing areas, such as silent speech interfaces [36] which are devices converting speech-related biosignals (e.g.…”
Section: Discussionmentioning
confidence: 99%
“…VC-EVC: A many-to-many eigenvoice conversion (EVC) system [54]. The eigenvoice GMM (EV-GMM) is constructed from the training data of one pivot speaker in the ATR Japanese speech database [55], and 273 speakers (137 male, 136 female) from the JNAS database 11 .…”
Section: A Spoofing Systemsmentioning
confidence: 99%
“…VC-EVC: This is a many-to-many eigenvoice conversion (EVC) system [34]. The eigenvoice GMM (EV-GMM) was constructed from the training data from one pivot speaker in the ATR Japanese speech database [35], and 273 speakers (137 male, 136 female) from the JNAS database.…”
Section: Ss-largementioning
confidence: 99%