2023
DOI: 10.1109/tce.2023.3236972
|View full text |Cite
|
Sign up to set email alerts
|

Multichannel CNN-BLSTM Architecture for Speech Emotion Recognition System by Fusion of Magnitude and Phase Spectral Features Using DCCA for Consumer Applications

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 26 publications
(7 citation statements)
references
References 53 publications
0
5
0
Order By: Relevance
“…3) Several kinds of features commonly extracted [6]: 4) Zero Crossing Rate (ZCR): ZCR indicates the frequency at which a signal crosses the zero-amplitude level, characterizing changes in signal amplitude. 7) Modified Group Delay Function (MODGD) [7]: MODGD extracts phase information from sound signals by calculating the group delay function, offering insights into phase characteristics.…”
Section: Preprocessing and Feature Engineeringmentioning
confidence: 99%
See 3 more Smart Citations
“…3) Several kinds of features commonly extracted [6]: 4) Zero Crossing Rate (ZCR): ZCR indicates the frequency at which a signal crosses the zero-amplitude level, characterizing changes in signal amplitude. 7) Modified Group Delay Function (MODGD) [7]: MODGD extracts phase information from sound signals by calculating the group delay function, offering insights into phase characteristics.…”
Section: Preprocessing and Feature Engineeringmentioning
confidence: 99%
“…2D CNN, involving the transformation of audio signals into spectrogram form, may result in the loss of some temporal information. The CNN-BLSTM architecture [7], capturing local features at different scales, presents challenges due to the need for precise parameter tuning and high computational requirements.…”
Section: Feature Engineeringmentioning
confidence: 99%
See 2 more Smart Citations
“…Prabhakar, Basel, Dutta and Rao [33]. developed a multichannel Convolution Neural Network-Bidirectional Long Short Term Memory (CNN-BLSTM) architecture with an attention mechanism for speaker-independent SER by considering phase and magnitude spectrum-based features.…”
Section: Introductionmentioning
confidence: 99%