Mathematical Methods in Interdisciplinary Sciences 2020
DOI: 10.1002/9781119585640.ch3
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Classification Techniques in Speech Emotion Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 85 publications
0
5
0
Order By: Relevance
“…, deep neural networks, is to employ transfer learning. Recently[ 109 ] has reviewed the application of generalizable transfer learning in AER in the existing literature. In particular, it provides an overview of the previously proposed transfer learning methods for speech-based emotion recognition by listing 21 relevant studies.…”
Section: The Data Science Approachmentioning
confidence: 99%
“…, deep neural networks, is to employ transfer learning. Recently[ 109 ] has reviewed the application of generalizable transfer learning in AER in the existing literature. In particular, it provides an overview of the previously proposed transfer learning methods for speech-based emotion recognition by listing 21 relevant studies.…”
Section: The Data Science Approachmentioning
confidence: 99%
“…The performance rate of a classifier largely depends on the type of feature extraction acquired. 35 Many hand-crafted features have been discovered by researchers to aid prediction performances. These include prosodic features (i.e., pitch, energy, zero crossing rate), 36,37 spectral features such as mel-frequency cepstral coefficients (MFCC), 38,39 gammatone frequency cepstral coefficients (GFCC), 40,41 human factor cepstral coefficients (HFCC), 41 linear predictor coefficients (LPC) and linear predictor cepstral coefficients (LPCC), 42,43 Teager-energy-operator (TEO) based features like TEO-decomposed FM variation (TEO-FM-Var), normalized TEO autocorrelation envelope area (TEO-Auto-Env) and critical bandbased TEO autocorrelation envelope area (TEO-CB-Auto-Env), 44,45 and quality features such as voice quality, tense, breathy and harsh.…”
Section: Ser Featuresmentioning
confidence: 99%
“…An emotional recognition system with respect to speech comprises of extraction of features and the prediction of emotions out of the extracted features via classification method. The performance rate of a classifier largely depends on the type of feature extraction acquired 35 . Many hand‐crafted features have been discovered by researchers to aid prediction performances.…”
Section: Related Workmentioning
confidence: 99%
“…The emotional features in speech signals can be extracted by the classification model. Speech emotional features are divided into three main categories: rhythmic features, phonological-related features, and spectral features [8]- [9].…”
Section: Introductionmentioning
confidence: 99%