2021
DOI: 10.48550/arxiv.2109.02915
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Few-shot Learning in Emotion Recognition of Spontaneous Speech Using a Siamese Neural Network with Adaptive Sample Pair Formation

Kexin Feng,
Theodora Chaspari

Abstract: Speech-based machine learning (ML) has been heralded as a promising solution for tracking prosodic and spectrotemporal patterns in real-life that are indicative of emotional changes, providing a valuable window into one's cognitive and mental state. Yet, the scarcity of labelled data in ambulatory studies prevents the reliable training of ML models, which usually rely on "data-hungry" distribution-based learning. Leveraging the abundance of labelled speech data from acted emotions, this paper proposes a few-sh… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 40 publications
0
1
0
Order By: Relevance
“…This network consists of two neural networks with the same structure [32][33][34]. Such a DPCNN architecture may be represented mathematically by the following: DPCNN can carry out single-sample learning [35]. This is symmetry.…”
Section: Decoupling Parallel Cnnmentioning
confidence: 99%
“…This network consists of two neural networks with the same structure [32][33][34]. Such a DPCNN architecture may be represented mathematically by the following: DPCNN can carry out single-sample learning [35]. This is symmetry.…”
Section: Decoupling Parallel Cnnmentioning
confidence: 99%