2016
DOI: 10.1587/transinf.2016edp7231
|View full text |Cite
|
Sign up to set email alerts
|

Non-Native Text-to-Speech Preserving Speaker Individuality Based on Partial Correction of Prosodic and Phonetic Characteristics

Abstract: SUMMARYThis paper presents a novel non-native speech synthesis technique that preserves the individuality of a non-native speaker. Crosslingual speech synthesis based on voice conversion or Hidden Markov Model (HMM)-based speech synthesis is a technique to synthesize foreign language speech using a target speaker's natural speech uttered in his/her mother tongue. Although the technique holds promise to improve a wide variety of applications, it tends to cause degradation of target speaker's individuality in sy… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 27 publications
0
5
0
Order By: Relevance
“…On the other hand, delta feature correction brought no significant improvements and sometimes caused significant degradation of naturalness. Also, power correction, which was effective in Japanese-accented English [4], also brought no improvements. This is because power is not dominant in Chinese and Japanese speech.…”
Section: Resultsmentioning
confidence: 95%
See 3 more Smart Citations
“…On the other hand, delta feature correction brought no significant improvements and sometimes caused significant degradation of naturalness. Also, power correction, which was effective in Japanese-accented English [4], also brought no improvements. This is because power is not dominant in Chinese and Japanese speech.…”
Section: Resultsmentioning
confidence: 95%
“…Therefore, the stress and duration of Japanese-accented English speech are significantly different from those of native English speech. Correction of such features for Japanese-accented English text-to-speech synthesis can be done by partial adaptation of HSMMs [4]. First, a native English speaker's HSMMs are trained using the speaker's speech data.…”
Section: Prosody Correction For Japanese-accented Englishmentioning
confidence: 99%
See 2 more Smart Citations
“…In simple words, the goal of scenario B was to reproduce the accent of non-native speakers. This scenario is closely related to reducing accents [67], [68] or controlling accents [24] tasks.…”
Section: B Capturing Unique Speaker Characteristicsmentioning
confidence: 99%