ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414540
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Network Based on the Fusion of Static and Dynamic Features for Speech Emotion Recognition

Abstract: Many studies on automatic speech emotion recognition (SER) have been devoted to extracting meaningful emotional features for generating emotion-relevant representations. However, they generally ignore the complementary learning of static and dynamic features, leading to limited performances. In this paper, we propose a novel hierarchical network called HNSD that can efficiently integrate the static and dynamic features for SER. Specifically, the proposed HNSD framework consists of three different modules. To c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 30 publications
(9 citation statements)
references
References 15 publications
0
9
0
Order By: Relevance
“…Performance comparison with 5-fold leaveone-session-out [12,7,13] and 10-fold leave-one-speaker-out [14,15,16,17,18,5] cross-validation strategy on IEMOCAP.…”
Section: Co-attention-based Fusionmentioning
confidence: 99%
See 4 more Smart Citations
“…Performance comparison with 5-fold leaveone-session-out [12,7,13] and 10-fold leave-one-speaker-out [14,15,16,17,18,5] cross-validation strategy on IEMOCAP.…”
Section: Co-attention-based Fusionmentioning
confidence: 99%
“…Model WA UA CNN-ELM+STC attention [12] 61.32 60.43 Audio 25 [7] 60.64±1.96 61.32±2.26 IS09 -classification [13] 68.1 63.8 Ours 69.80 71.05 RNN(prop. )-ELM [14] 62.85 63.89 3D ACRNN [15] -64.74±5.44 BLSTM-CTC-CA [16] 69.0 67.0 CNN GRU-SeqCap [17] 72.73 59.71 CNN TF Att.pooling [18] 71.75 68.06 HNSD [5] 70.5 72.5 Ours 71.64 72.70…”
Section: Co-attention-based Fusionmentioning
confidence: 99%
See 3 more Smart Citations