2012
DOI: 10.1007/s10772-012-9172-2
|View full text |Cite
|
Sign up to set email alerts
|

Emotion recognition from speech using global and local prosodic features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
43
0
1

Year Published

2012
2012
2015
2015

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 150 publications
(49 citation statements)
references
References 25 publications
0
43
0
1
Order By: Relevance
“…One possible approach is to divide the speech signal into many small intervals, i.e., frames, and construct a local feature vector for each frame. For example, prosodic speech features such as pitch and energy, can be extracted from each interval and are considered as local features [26], [30]. On the other hand, global features such as statistics, can be obtained from the whole speech utterance [22], [23], [16], which typically have a lower dimension than the local ones, leading to less computing time [13], [30].…”
Section: B Feature Extractionmentioning
confidence: 99%
“…One possible approach is to divide the speech signal into many small intervals, i.e., frames, and construct a local feature vector for each frame. For example, prosodic speech features such as pitch and energy, can be extracted from each interval and are considered as local features [26], [30]. On the other hand, global features such as statistics, can be obtained from the whole speech utterance [22], [23], [16], which typically have a lower dimension than the local ones, leading to less computing time [13], [30].…”
Section: B Feature Extractionmentioning
confidence: 99%
“…Possible improvements of the proposed work would be the inclusion of other features, as those related to glottal, spectral and energy parameters, as well as a more detailed description of the temporal dynamics of F0 at sentence and word level [18].…”
Section: Discussionmentioning
confidence: 99%
“…Recently, the relevance of shape, slope and range of F0 contour in emotional speech perception, synthesis and recognition has been described [11,[16][17][18]. Moreover, local prosodic features that are related to the temporal dynamics description of prosody have been found to improve the information carried by global, static prosodic features [18].…”
Section: Introductionmentioning
confidence: 99%
“…Formants of voice are another family features [16]. Other features such as SNR, HNR, speech rate and speech quality are prosodic features [22,23,24]. MFCC, LPCC and LFPC are frequency features of voice.…”
Section: A Features Of Voicementioning
confidence: 99%