Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-1625
|View full text |Cite
|
Sign up to set email alerts
|

Transfer Learning for Improving Speech Emotion Classification Accuracy

Abstract: The majority of existing speech emotion recognition research focuses on automatic emotion detection using training and testing data from same corpus collected under the same conditions. The performance of such systems has been shown to drop significantly in cross-corpus and cross-language scenarios. To address the problem, this paper exploits a transfer learning technique to improve the performance of speech emotion recognition systems that is novel in cross-language and cross-corpus scenarios. Evaluations on … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
51
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
4
2

Relationship

2
8

Authors

Journals

citations
Cited by 108 publications
(51 citation statements)
references
References 25 publications
0
51
0
Order By: Relevance
“…In [9,10,11,12], adversarial learning paradigm * Both the authors contributed equally to this paper is explored for robust recognition. In [13,14], transfer learning approach is explored.…”
Section: Introductionmentioning
confidence: 99%
“…In [9,10,11,12], adversarial learning paradigm * Both the authors contributed equally to this paper is explored for robust recognition. In [13,14], transfer learning approach is explored.…”
Section: Introductionmentioning
confidence: 99%
“…making it difficult to avoid speaker adaptation. Fully supervised techniques trained on such datasets hence often demonstrate high accuracy for only intra-corpus data, with a natural propensity to overfit [42].…”
Section: Introductionmentioning
confidence: 99%
“…Research on speech emotion recognition primarily focuses on hand-engineered acoustic features as well as on designing e cient machine learning based models for accurate emotion prediction [6,7]. In particular, building an appropriate feature representation and designing an appropriate classi er for these features have o en been treated as separate problems in the speech recognition community.…”
Section: Introductionmentioning
confidence: 99%