2012
DOI: 10.1016/j.specom.2011.12.004
|View full text |Cite
|
Sign up to set email alerts
|

Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping

Abstract: , K 2012, 'Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping ' Speech Communication, vol. 54, no. 6, pp. 703-714. DOI: 10.1016/j.specom.2011 General rightsCopyright for the publications made accessible via the Edinburgh Research Explorer is retained by the author(s) and / or other copyright owners and it is a condition of accessing these publications that users recognise and abide by the legal requirements associated with these rights. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
10
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4
2
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 16 publications
(10 citation statements)
references
References 45 publications
0
10
0
Order By: Relevance
“…In the ML-based eigenvoice approach, given some adaptation data χa = {x (1) , x (2) , ..., x (No,s) }, No,s is the total number of observations from speaker s, the likelihood function…”
Section: Eigenvoice Adaptationmentioning
confidence: 99%
See 1 more Smart Citation
“…In the ML-based eigenvoice approach, given some adaptation data χa = {x (1) , x (2) , ..., x (No,s) }, No,s is the total number of observations from speaker s, the likelihood function…”
Section: Eigenvoice Adaptationmentioning
confidence: 99%
“…Cross-lingual speaker adaptation (CLSA) for statistical speech synthesis is used for adapting to a target speaker in an output language, using adaptation data from the speaker in an input language. CLSA algorithms have many applications such as deployment in speech-to-speech translation systems [1,2].…”
Section: Introductionmentioning
confidence: 99%
“…Cross-lingual speaker adaptation (CLSA) for statistical speech synthesis is a method for adapting a text-to-speech (TTS) system for a desired output language, given adaptation data (i.e., speech) from the target speaker in a different input language. Applications include speech-to-speech translation [1], [2].…”
Section: Introductionmentioning
confidence: 99%
“…For example, one-to-many Gaussian Mixture Model (GMM)-based voice conversion can be applied to unsupervised speaker adaptation in cross-lingual speech synthesis [11], [12]. In addition, cross-lingual adaptation parameter mapping [13]- [15] and cross-lingual frame mapping [16] have also been proposed for HMM-based speech synthesis. These approaches use a non-native speaker's natural voice in his/her mother tongue to extract speakerdependent acoustic characteristics and make it possible to synthesize naturally sounding target language voices.…”
mentioning
confidence: 99%