2022
DOI: 10.48550/arxiv.2201.03511
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A study on cross-corpus speech emotion recognition and data augmentation

Abstract: Models that can handle a wide range of speakers and acoustic conditions are essential in speech emotion recognition (SER). Often, these models tend to show mixed results when presented with speakers or acoustic conditions that were not visible during training. This paper investigates the impact of cross-corpus data complementation and data augmentation on the performance of SER models in matched (test-set from same corpus) and mismatched (test-set from different corpus) conditions. Investigations using six emo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 20 publications
(38 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?