2018 IEEE Spoken Language Technology Workshop (SLT) 2018
DOI: 10.1109/slt.2018.8639556
|View full text |Cite
|
Sign up to set email alerts
|

Comprehensive Evaluation of Statistical Speech Waveform Synthesis

Abstract: Statistical TTS systems that directly predict the speech waveform have recently reported improvements in synthesis quality. This investigation evaluates Amazon's statistical speech waveform synthesis (SSWS) system. An in-depth evaluation of SSWS is conducted across a number of domains to better understand the consistency in quality. The results of this evaluation are validated by repeating the procedure on a separate group of testers. Finally, an analysis of the nature of speech errors of SSWS compared to hybr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 25 publications
0
8
0
Order By: Relevance
“…On every test the positions of the systems on the panel were randomised. All the panels included the natural recordings as an upper anchor but similar to [20] subjects were not forced to assign the top score to any of the systems. All tests were conducted in Amazon Mechanical Turk.…”
Section: Subjective Evaluationmentioning
confidence: 99%
“…On every test the positions of the systems on the panel were randomised. All the panels included the natural recordings as an upper anchor but similar to [20] subjects were not forced to assign the top score to any of the systems. All tests were conducted in Amazon Mechanical Turk.…”
Section: Subjective Evaluationmentioning
confidence: 99%
“…Recently, many medical and healthcare devices based on deep learning technology have been proposed for tasks such as the detection of pathological voice [71], healthcare monitoring [72], heart disease prediction [73], detection and reconstruction of dysarthric speech [74], and speech waveform synthesis [75]. Through the application of deep learning with big data, we gain many benefits for healthcare applications compared with traditional approaches.…”
Section: The Existing Application Of Deep Learning Technology In Heal...mentioning
confidence: 99%
“…We conducted MUSHRA perceptual test [37]. Every listener was presented with 6 versions of a given word at the same time, 5 reconstructions and one version of recorded speech.…”
Section: Reconstruction Of Dysarthric Speechmentioning
confidence: 99%