2023
DOI: 10.3389/fpsyg.2022.1075624
|View full text |Cite
|
Sign up to set email alerts
|

Speech emotion recognition based on improved masking EMD and convolutional recurrent neural network

Abstract: Speech emotion recognition (SER) is the key to human-computer emotion interaction. However, the nonlinear characteristics of speech emotion are variable, complex, and subtly changing. Therefore, accurate recognition of emotions from speech remains a challenge. Empirical mode decomposition (EMD), as an effective decomposition method for nonlinear non-stationary signals, has been successfully used to analyze emotional speech signals. However, the mode mixing problem of EMD affects the performance of EMD-based me… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 53 publications
0
3
0
Order By: Relevance
“…The creators stack two parallel CNNs to speak to spatial highlights in parallel with a change encoder to speak to transient highlights, in this manner at the same time extending the channel profundity and reducing the include outline by expressive progressive highlight representation with less computational cost. To overcome the issues, in [22] proposed a modern SER system, called IMEMD-CRNN, based on the combination of an progressed adaptation of envelope signal-based EMD (IMEMD) and convolutional repetitive neural organize (CRNN). To begin with, IMEMD is proposed for discourse parsing.…”
Section: Speech Emotion Recognitionmentioning
confidence: 99%
“…The creators stack two parallel CNNs to speak to spatial highlights in parallel with a change encoder to speak to transient highlights, in this manner at the same time extending the channel profundity and reducing the include outline by expressive progressive highlight representation with less computational cost. To overcome the issues, in [22] proposed a modern SER system, called IMEMD-CRNN, based on the combination of an progressed adaptation of envelope signal-based EMD (IMEMD) and convolutional repetitive neural organize (CRNN). To begin with, IMEMD is proposed for discourse parsing.…”
Section: Speech Emotion Recognitionmentioning
confidence: 99%
“…Since 2012, the convolutional neural network has made major breakthroughs in the field of image recognition, and its recognition capabilities have surpassed humans . Therefore, the convolutional neural network is also widely used in emotion classification, , speech recognition, , agricultural engineering, , defect detection, fault diagnosis, and other fields and has made significant breakthroughs and improvements. Taking coal gangue recognition as an example, the convolutional neural network has been widely used and has achieved important research results.…”
Section: Theoretical Backgroundmentioning
confidence: 99%
“…Data are transitional electronic documents where documentary information can be stored in a medium and the corresponding electronic device identifies the physical symbol, a new form of representation of data, facts, concepts or instructions [13][14][15]. With the development of computer and network technology, data and information processing has improved efficiency and convenience [16].…”
Section: Data Processingmentioning
confidence: 99%