2024 20th CSI International Symposium on Artificial Intelligence and Signal Processing (AISP) 2024
DOI: 10.1109/aisp61396.2024.10475274
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Level Speaker-Independent Emotion Recognition Using Complex-MFCC and Swin Transformer

MohammadReza Saadati,
Rahil Mahdian Toroghi,
Hassan Zareian
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…In contrast, complex-valued neural network (CVNN) has the ability to capture both magnitude and phase information inherent in audio signals, offering a more comprehensive representation of the underlying data. In the realm of speech processing, CVNN has demonstrated promising performance in tasks [19][20][21][22][23]. For example, in speech recognition tasks, CVNNs have shown improved robustness to background noise and reverberation, leading to higher recognition accuracy, especially in adverse acoustic conditions [24].…”
Section: Introductionmentioning
confidence: 99%
“…In contrast, complex-valued neural network (CVNN) has the ability to capture both magnitude and phase information inherent in audio signals, offering a more comprehensive representation of the underlying data. In the realm of speech processing, CVNN has demonstrated promising performance in tasks [19][20][21][22][23]. For example, in speech recognition tasks, CVNNs have shown improved robustness to background noise and reverberation, leading to higher recognition accuracy, especially in adverse acoustic conditions [24].…”
Section: Introductionmentioning
confidence: 99%