2006
DOI: 10.1109/tasl.2006.876123
|View full text |Cite
|
Sign up to set email alerts
|

The IBM expressive text-to-speech synthesis system for American English

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
52
0
3

Year Published

2008
2008
2022
2022

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 111 publications
(55 citation statements)
references
References 6 publications
0
52
0
3
Order By: Relevance
“…The US-based ESS requires a large scale corpus of emotional speech, and emotional property of synthetic speech is colored by acoustic correlates represented by inventories in a corpus [2,[47][48][49][50][51]. The quality of ESS is dependent on the corpus and how to select inventories from the corpus.…”
Section: Corpus-based Approachmentioning
confidence: 99%
See 1 more Smart Citation
“…The US-based ESS requires a large scale corpus of emotional speech, and emotional property of synthetic speech is colored by acoustic correlates represented by inventories in a corpus [2,[47][48][49][50][51]. The quality of ESS is dependent on the corpus and how to select inventories from the corpus.…”
Section: Corpus-based Approachmentioning
confidence: 99%
“…This approach is simple but it needs large costs for constructing corpora. Pitrelli et al proposed another approach of the USbased ESS by building a corpus that mixed several emotions and by introducing the emotion as a feature for inventory selection [49]. Moriyama et al proposed an idea for representing relationship among F 0 , energy, and duration using PCA on a subspace in ESS for Japanese words [50,51].…”
Section: Corpus-based Approachmentioning
confidence: 99%
“…1 shows a set of warping functions, depending on the values of ζ. It is clear that λ ∈ [1,2] can be mapped to multiple F0 values in [f0 l , f0 h ] when ζ is altered. This forms the basis of frequency modulation in that λ and ζ can be used to represent the observed F0 contours and the adjusting proportions, respectively.…”
Section: Frequency Modulationmentioning
confidence: 99%
“…Furthermore, significant progress has been made in corpus-based unit concatenative synthesis technology [2] [3]. These two things have led to an improvement in voice quality of synthetic speech, which in turn has led to it becoming more common.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation