2021 12th International Symposium on Chinese Spoken Language Processing (ISCSLP) 2021
DOI: 10.1109/iscslp49672.2021.9362058
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Cross-Lingual Speech Emotion Recognition Using Domain Adversarial Neural Network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 26 publications
0
4
0
Order By: Relevance
“…They evaluate the proposed architecture against cross-corpus settings and achieve improved results compared to AE-based models and Danns. In [165], the authors evaluate a DANN against cross-lingual SER. They use GRL with a language classifier, which helps the model to learn language-independent emotional representations.…”
Section: Deep Domain Adaptive Representation Learningmentioning
confidence: 99%
“…They evaluate the proposed architecture against cross-corpus settings and achieve improved results compared to AE-based models and Danns. In [165], the authors evaluate a DANN against cross-lingual SER. They use GRL with a language classifier, which helps the model to learn language-independent emotional representations.…”
Section: Deep Domain Adaptive Representation Learningmentioning
confidence: 99%
“…It is designed to address challenges related to training deep neural networks by normalizing activations within smaller groups instead of the entire batch. GN does not impact cross-domain transfer learning and can save memory [15]. The mathematical formulation of Group Normalization is given by:…”
Section: Self-normalization Network Blockmentioning
confidence: 99%
“…Previous studies have identified several popular feature extraction techniques for SER, including low-level descriptors (LLDs), the mel spectrogram, the wav2vec representation [7], and feature selection based on genetic algorithms [8][9][10][11]. LLDs have been used in studies [12][13][14], while the mel spectrogram has been used in studies such as [15][16][17][18][19]. Wav2vec, on the other hand, has been used in studies [20][21][22][23][24][25].…”
Section: Introductionmentioning
confidence: 99%
“…The third observation is that no studies have been conducted on cross-lingual SER using S3R features to date. The features commonly used in the community, such as the mel spectrogram [16], usually contain some unhelpful information for SER, such as language information. In contrast, language information may even degrade performance.…”
Section: Introductionmentioning
confidence: 99%