2016
DOI: 10.1007/978-3-319-31056-5_11
|View full text |Cite
|
Sign up to set email alerts
|

Analysis of Emotional Speech—A Review

Abstract: Speech carries information not only about the lexical content, but also about the age, gender, signature and emotional state of the speaker. Speech in different emotional states is accompanied by distinct changes in the production mechanism. In this chapter, we present a review of analysis methods used for emotional speech. In particular, we focus on the issues in data collection, feature representations and development of automatic emotion recognition systems. The significance of the excitation source compone… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
32
0
1

Year Published

2016
2016
2024
2024

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 54 publications
(37 citation statements)
references
References 123 publications
(157 reference statements)
0
32
0
1
Order By: Relevance
“…In future work, the long-term speech spectrum and extracted spectral parameters will be tested for their robustness to various factors affecting speech, such as emotions [7], physical fatigue [12], vocal effort [9,28], and others [4]. Furthermore, it will be useful to also investigate the influence of adverse acoustic conditions, especially non-stationary noises [29] due to their indirect influence on speech production.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In future work, the long-term speech spectrum and extracted spectral parameters will be tested for their robustness to various factors affecting speech, such as emotions [7], physical fatigue [12], vocal effort [9,28], and others [4]. Furthermore, it will be useful to also investigate the influence of adverse acoustic conditions, especially non-stationary noises [29] due to their indirect influence on speech production.…”
Section: Discussionmentioning
confidence: 99%
“…The reference values were obtained and stored in system database during training. Due to the very unbalanced numerical values of both parameter types (e.g., in the lowest subband E>10 7 and K<-1), each individual difference (x R ─ x) 2 is scaled by the denominator (x R ) 2 before summing in Equation (8). In the case of the simplest recognition based on one parameter extracted from one subband, Equation (8) can be reduced to the form…”
Section: Speaker Recognitionmentioning
confidence: 99%
“…Identification of epoch locations plays a crucial role in many speech processing applications such as speech modification [1], excitation source modeling [2], inverse filtering [3,4], joint optimization in concatenative speech synthesis [5], speech pathology [6,7], etc. Apart from above applications, the high SNR property of the GCI was used in applications like glottal activity detection [8], pitch tracking [9,10], formant frequencies [11], analysis and detection of phonation types [12,13] and emotions [14,15], speaker recognition [16], speech enhancement [8], multi-speaker separation, identification of number of speakers from multi-speakers data [17] etc. Due to wider range of applications, GCI detection has received a considerable amount of research attention.…”
Section: Introductionmentioning
confidence: 99%
“…Besides, the prosodic features conveying significant emotional information are utilized and analyzed in many previous studies [2], [8]. Pitch, as one of the prosodic features, has been found discriminative across different emotions, to some extent.…”
Section: Introductionmentioning
confidence: 99%