2012
DOI: 10.1016/j.dsp.2012.05.007
|View full text |Cite
|
Sign up to set email alerts
|

Speech emotion recognition: Features and classification models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

4
78
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 208 publications
(82 citation statements)
references
References 16 publications
4
78
0
Order By: Relevance
“…• Speech Emotion Recognition: identify emotional states of human by analyzing their voice [8][9][10]. However, the main limitation of this approach is users need to speak in order to recognize their emotion.…”
Section: Emotion Recognitionmentioning
confidence: 99%
“…• Speech Emotion Recognition: identify emotional states of human by analyzing their voice [8][9][10]. However, the main limitation of this approach is users need to speak in order to recognize their emotion.…”
Section: Emotion Recognitionmentioning
confidence: 99%
“…Among all speech emotion features, prosodic feature and spectral feature are the most representative types of speech emotion features that are comprehensively employed in speech emotion recognition (Wu et al, 2011;Jin et al, 2013). The frequently adopted prosodic feature contains pitch, formants, energy, speed, and so on (Chen et al, 2012;Ayadi et al, 2011). The spectral feature is deemed to offer certain supplementary and various speech emotion features comparing with the prosodic feature and mel-frequency cepstral coefficients (MFCC), linear predictive cepstral coefficients (LPC) and log-frequency power coefficients (LFPC) are three most classic spectral features that are broadly adopted in a number of speech emotion recognition approaches (Wu et al, 2011;Ayadi et al, 2011;Jin et al, 2013).…”
Section: Introductionmentioning
confidence: 99%
“…How to pick up the most beneficial and serviceable speech emotion features which can accessibly and accurately represent the speech emotion information is the pivotal problem for speech emotion recognition (Chen et al, 2012;Jin et al, 2013;Ayadi et al, 2011). Among all speech emotion features, prosodic feature and spectral feature are the most representative types of speech emotion features that are comprehensively employed in speech emotion recognition (Wu et al, 2011;Jin et al, 2013).…”
Section: Introductionmentioning
confidence: 99%
“…In multiclass SVM classifier, for example, the structure of the classifier can be one-to-one, one-to-all, hierarchy, or tree structure, so several SVM nodes or models exist in the multiclass classifier [1][2][3]. There are two questions in speech emotion recognition (SER): (1) how to seek the optimal feature subset from the acoustic features; (2) whether the same acoustic feature subset is proper in all nodes of the multiclass classifier. These questions are researched in this paper.…”
Section: Introductionmentioning
confidence: 99%