2006
DOI: 10.1109/tasl.2006.876113
|View full text |Cite
|
Sign up to set email alerts
|

Prosody conversion from neutral speech to emotional speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

2007
2007
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 178 publications
(14 citation statements)
references
References 12 publications
0
14
0
Order By: Relevance
“…The most cited paper before the 2000s [ 9 ] had more than 200 citations. Between 2000 to 2010, the most cited paper [ 10 ] also had more than 200 citations. Finally, the most cited paper [ 11 ] after 2010 also had more than 200 citations, indicating an increased interest in emotional speech synthesis in recent years.…”
Section: Past Studies On Emotional Speech Synthesismentioning
confidence: 99%
See 1 more Smart Citation
“…The most cited paper before the 2000s [ 9 ] had more than 200 citations. Between 2000 to 2010, the most cited paper [ 10 ] also had more than 200 citations. Finally, the most cited paper [ 11 ] after 2010 also had more than 200 citations, indicating an increased interest in emotional speech synthesis in recent years.…”
Section: Past Studies On Emotional Speech Synthesismentioning
confidence: 99%
“…In the early 2000s, the trend shifted to parametric speech synthesis, with hidden Markov model (HMM)-based synthesis being the most popular (see rows three to eight of Table 1 ). Parametric speech synthesis increased the need for good quality databases (the term good quality here refers to recordings in recording studio environments that have controlled noise levels) with adequate phonetic coverage (between 500 [ 15 , 16 ] to 1500 [ 10 ] sentences and larger corpora with 11 h of neutral speech recording [ 15 ]). ( Neutral this context refers to speech without any emotions.)…”
Section: Past Studies On Emotional Speech Synthesismentioning
confidence: 99%
“…Items with a recognition rate of less than 60% were discarded. The limit of 60% was chosen based on Tickle's (2000) recommendation.…”
Section: Recognizable Emotional Utterancesmentioning
confidence: 99%
“…Basically, they changed prosody parameters like basic frequency (F0), value and pitch, of neutral speech to make speech emotional. [4] Murtaza Bulut models the prosody parameters of part of speech (POS) to enhance the naturalness of emotional speech. [1] Shinya Mori divides the prosody parameter space into some subspaces, and research the restriction of these subspaces to give speech emotion.…”
Section: Introductionmentioning
confidence: 99%