2017
DOI: 10.1016/j.asoc.2017.03.013
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid BBO_PSO and higher order spectral features for emotion and stress recognition from natural speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
2
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 41 publications
(8 citation statements)
references
References 49 publications
0
2
0
Order By: Relevance
“…The stress recognition model presented in [11] has two stage feature subset selection procedure in which a multi-cluster feature selection is employed in the first stage to identify the relevant features and to minimize the feature space. Biogeography and particle swarm optimization algorithms are incorporated in the second stage to reduce the feature dimensions and effectively differentiate different emotions.…”
Section: Machine Learning Based Emotion Classification Modelsmentioning
confidence: 99%
“…The stress recognition model presented in [11] has two stage feature subset selection procedure in which a multi-cluster feature selection is employed in the first stage to identify the relevant features and to minimize the feature space. Biogeography and particle swarm optimization algorithms are incorporated in the second stage to reduce the feature dimensions and effectively differentiate different emotions.…”
Section: Machine Learning Based Emotion Classification Modelsmentioning
confidence: 99%
“…Instead of treating each speaker and emotion separately, the model jointly optimize feature selection across multiple tasks, and it identifies the features that are relevant for emotion recognition of different speakers. Yogesh et al used high order spectral analysis (HOSA) to extract BSF and BCF from speech [22]. By combining these features with the standard 2010 interspeech features, the performance of a real-time SER system is further enhanced.…”
Section: Related Workmentioning
confidence: 99%
“…To validate the superiority of the proposed BiEO, the classification performance is compared with EO [28], BBO_PSO [22] and MOBFA [30]. EO is utilized to test whether the proposed BiEO enhances the performance of classical EO.…”
Section: B Experimental Setupmentioning
confidence: 99%
“…Later, specifically for anger emotion recognition, acoustic (pitch, loudness, spectral features) and linguistic (probabilistic and entropy-based words and phrases) cues [ 6 ] were introduced. Apart from these, other different feature extraction techniques such as a sinusoidal model-based feature extraction technique with frequency, magnitude, and phase features [ 7 ]; empirical mode decomposition method; feature optimization method [ 8 ] to select particular frames of the speech signal by choosing proper filter bank; hybrid biogeography-based optimization; and particle swarm optimization (BBO_PSO) [ 9 ] by the proper selection of higher-order spectral features were used for depressed emotion recognition. Yang and Lugger [ 10 ] proposed the combination of qualitative and voice quality features; Wen et al [ 11 ] proposed weighted spectral local Hu parameters to overcome the disadvantage of MFCC feature; Wang et al [ 12 ] proposed Fourier parameters; Setayeshi et al [ 13 ] proposed a bioinspired ANFIS technique combined with MLP for SER for anger, happy and sad emotions; and, Ying and Xue-Ying [ 14 ] proposed glottal compensation to zero crossings with maximal Teager energy operator (GCZCMT) for speech emotion recognition and performed well compared to MFCC feature.…”
Section: Introductionmentioning
confidence: 99%