2017
DOI: 10.1007/978-3-319-64206-2_36
|View full text |Cite
|
Sign up to set email alerts
|

Last Syllable Unit Penalization in Unit Selection TTS

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…The final comparison of the evaluation experiment using sentences of the speech corpus SC1 with the results obtained by the standard listening test method described in more detail in [11] shows principal correspondence as documented by the graphs in Figure 11. While the results for the M1, F1, and M2 voices are stable and prefer the TTS 2 method for both databases, for the F2 voice the results are classified as similar in the TTS 1 as well as the TTS 2 .…”
Section: Discussion Of the Obtained Resultsmentioning
confidence: 63%
See 4 more Smart Citations
“…The final comparison of the evaluation experiment using sentences of the speech corpus SC1 with the results obtained by the standard listening test method described in more detail in [11] shows principal correspondence as documented by the graphs in Figure 11. While the results for the M1, F1, and M2 voices are stable and prefer the TTS 2 method for both databases, for the F2 voice the results are classified as similar in the TTS 1 as well as the TTS 2 .…”
Section: Discussion Of the Obtained Resultsmentioning
confidence: 63%
“…To evaluate synthetic speech quality by continual classification in the P-A scale, we collected the first speech corpus (SC1) consisting of three parts: the original speech uttered by real speakers, and two variations of speech synthesis produced by the Czech TTS system using the USEL method [16] with voices based on the original speaker. Two methods of prosody manipulation were applied: the rule-based method (assigned as TTS A ) and the modified version reflecting the final syllable status (as TTS B ) [11]. The natural as well as the synthetic speech originates from four professional speakers-two males (M1, M2) and two females (F1, F2).…”
Section: Materials Used Initial Settings and Conditionsmentioning
confidence: 99%
See 3 more Smart Citations