2021
DOI: 10.1109/access.2021.3086062
|View full text |Cite
|
Sign up to set email alerts
|

An Enhanced Emotion Recognition Algorithm Using Pitch Correlogram, Deep Sparse Matrix Representation and Random Forest Classifier

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 37 publications
0
3
0
Order By: Relevance
“…English and Arabic datasets were used and processed to extract features after noise reduction. The four different datasets used were the ESD private Arabic dataset and the SUSAS, RAVDESS, and SAVEE public English datasets, and an average accuracy of more than 80% was achieved [ 13 ]. S. Hamsa et al proposed an emotionally intelligent system to identify the emotion of an unknown speaker using energy, time, and spectral features for three distinct speech datasets of two different languages.…”
Section: Related Workmentioning
confidence: 99%
“…English and Arabic datasets were used and processed to extract features after noise reduction. The four different datasets used were the ESD private Arabic dataset and the SUSAS, RAVDESS, and SAVEE public English datasets, and an average accuracy of more than 80% was achieved [ 13 ]. S. Hamsa et al proposed an emotionally intelligent system to identify the emotion of an unknown speaker using energy, time, and spectral features for three distinct speech datasets of two different languages.…”
Section: Related Workmentioning
confidence: 99%
“…The SARSA (state action reward state action) learning method is a complete state-action transition. When updating the current state action [ 18 ], this method does not use the state value function at the next moment but randomly selects actions to update the current state-action space according to a certain probability value and determines the execution action at the next moment when updating, which is the core of the SARSA method [ 19 ].The mathematical expression is described as follows: …”
Section: Methodsmentioning
confidence: 99%
“…Support vector machines are employed as a classifier in the final stage. Hamsa et al [18] proposed a speaker-independent & textindependent SER system for real applications where the speech is noisy and talking conditions like stressful. They modeled the work with the combination of the pre-processing stage as pitch-correlogram-based noise reduction, the feature representation method as sparse-dense decomposition, and Random Forest as a classifier.…”
Section: Literature Surveymentioning
confidence: 99%