2015
DOI: 10.1016/j.specom.2015.09.002
|View full text |Cite
|
Sign up to set email alerts
|

Intelligibility of time-compressed synthetic speech: Compression method and speaking style

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 27 publications
0
4
0
Order By: Relevance
“…Ninth, it is unclear how effective the spearcons would be if they were produced from text-to-speech, as many spearcons are (Walker et al, 2006). As noted in the Introduction to Experiment 1, Valentini-Botinhao et al (2015) found that compressed speech was more understandable when based on natural speech than when based on text-to-speech, especially at high compression rates. Whether that would be the same for compressed speech as short as spearcons remains to be seen.…”
Section: Relationship With Conventional Alarmsmentioning
confidence: 99%
See 1 more Smart Citation
“…Ninth, it is unclear how effective the spearcons would be if they were produced from text-to-speech, as many spearcons are (Walker et al, 2006). As noted in the Introduction to Experiment 1, Valentini-Botinhao et al (2015) found that compressed speech was more understandable when based on natural speech than when based on text-to-speech, especially at high compression rates. Whether that would be the same for compressed speech as short as spearcons remains to be seen.…”
Section: Relationship With Conventional Alarmsmentioning
confidence: 99%
“…Specifically, we used naturally spoken English for the uncompressed speech. A recent study indicated that time-compressed speech based on naturally spoken English leads to fewer comprehension errors than when based on TTS, especially at high compression rates (Valentini-Botinhao, Toman, Pucher, Schabus, & Yamagishi, 2015). We used the utterance position speed-up (UPSU) method to compress the speech, which compresses the start of the utterance slightly less than the end of the utterance to help comprehension as the words are presented (Dupoux & Green, 1997; Tucker & Whittaker, 2008).…”
Section: Experiments 1: Spearcons Versus Earcons For Single-patient M...mentioning
confidence: 99%
“…where k=1,.., Ng, Δ lu represents the quant width, whereas Ng is a number of representation levels [3], [11]. Log-uniform quantizer is designed for low and middle bit-rates (number of quantization levels (Ng) is 2, 4, 8 and 16).…”
Section: Transform Coding and Quantizers Designmentioning
confidence: 99%
“…This way, signal compression makes storing and transmission of the digital signal easier, since it requires less memory resources and narrower bandwidth for transmission while customers' experience is satisfactory [1], [2]. Although natural speech signal processing is the mostly researched, traditionally, with the growth of information technologies a lot of papers are dedicated to synthetic speech signal processing, due to its' importance in education (distance learning, foreign languages, blind individuals) and automatic recognition [3], [4].…”
Section: Introductionmentioning
confidence: 99%