Anais Do XX Encontro Nacional De Inteligência Artificial E Computacional (ENIAC 2023) 2023
DOI: 10.5753/eniac.2023.234481
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Audio Emotion Recognition with Graph-based Consensus Pseudolabeling

Gabriel Natal Coutinho,
Artur de Vlieger Lima,
Juliano Yugoshi
et al.

Abstract: This paper presents a novel method called Multimodal Graph-based Consensus Pseudolabeling (MGCP) for unsupervised emotion recognition in audio. The goal is to determine the emotion of audio segments using the circumplex model of emotions. The method combines pre-trained unimodal models for audio and text and follows a three-step process. First, audio segments are represented using embeddings from unimodal models. Then, modality-specific graphs are constructed based on similarity and integrated into a multimoda… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 23 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?