Text, Speech and Language Technology
DOI: 10.1007/978-1-4020-5817-2_2
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of Speech Synthesis

Abstract: This chapter discusses the evaluation of speech synthesis. It does not attempt to present an overview of all the techniques that may be used, or to cover the full history of previous evaluations, but instead it highlights some of the weaknesses of previous attempts, and points out areas where future development may be needed. It presents the view that speech synthesis should be judged not as a technology, but as a performance, since the actual intended listener presumably has less interest in the achievements … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 12 publications
(9 citation statements)
references
References 66 publications
0
9
0
Order By: Relevance
“…On the other hand, correct prosody and great naturalness are essential in most multimedia applications. The evaluation can be carried out at different levels (segment, word, sentence or paragraph) and with different kinds of tests (Campbell 2007).…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…On the other hand, correct prosody and great naturalness are essential in most multimedia applications. The evaluation can be carried out at different levels (segment, word, sentence or paragraph) and with different kinds of tests (Campbell 2007).…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…Although evaluations for naturalness and intelligibility had become standard by this point, a 2007 book chapter [20] predicts that tone of voice, manner of speaking, emotional expressiveness, and, generally speaking, "interpersonal skills" will become more important for speech synthesis in the future, and so we will need to find ways to evaluate these aspects as well.…”
Section: Mid-1990s and 2000s: Naturalness Intelligibility And Efforts...mentioning
confidence: 99%
“…The recommended standard maintained by the International Telecommunication Union is the mean opinion score (MOS) [64]. Raters give an objective assessment of speech quality on a 5-point scale which involves different areas of comprehension (how easy was it to understand the spoken sentence), naturalness, intelligibility (how well does a transcription by the raters match the original written sentence) and likeability [65]. Both objective as subjective measures are used and quantified, allowing for statistical comparison.This includes design of a measure that is robust to increasing complexity that can carry across different context [66].…”
Section: Recommendations For Gesture Evaluationmentioning
confidence: 99%