2020
DOI: 10.1109/access.2020.3005664
|View full text |Cite
|
Sign up to set email alerts
|

Different Contextual Window Sizes Based RNNs for Multimodal Emotion Detection in Interactive Conversations

Abstract: Multimodal emotion detection (MED) in interactive conversations is extremely important for improving the overall human-computer interaction experience. Present research methods in this domain do not explicitly distinguish the contexts of a test utterance in a meaningful way while classifying emotions in conversations. In this paper, we propose a model, named different contextual window sizes based recurrent neural networks (DCWS-RNNs), to differentiate the contexts. The model has four recurrent neural networks… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 50 publications
0
2
0
Order By: Relevance
“…Neural networks are used to filter this invalid data from videos and physiological signals by using continuity and semantic compatibility. H. Lai et al [53] proposed a model for emotion recognition in interactive conversations. In this model, different RNN architectures with variable contextual window sizes differentiate various aspects of contexts in conversations to improve the accuracy.…”
Section: Multimodal Emotion Recognition (Mmer)mentioning
confidence: 99%
“…Neural networks are used to filter this invalid data from videos and physiological signals by using continuity and semantic compatibility. H. Lai et al [53] proposed a model for emotion recognition in interactive conversations. In this model, different RNN architectures with variable contextual window sizes differentiate various aspects of contexts in conversations to improve the accuracy.…”
Section: Multimodal Emotion Recognition (Mmer)mentioning
confidence: 99%
“…Neural networks are used to filter this invalid data from videos and physiological signals by using continuity and semantic compatibility. H. Lai et al [75] proposed a model for emotion recognition in interactive conversations. In this model, different RNN architectures with variable contextual window sizes differentiate various aspects of contexts in conversations to improve the accuracy.…”
Section: Other MMDL Applicationsmentioning
confidence: 99%
“…The next generation of revolutionary HCI technology may not impact the whole industry, such as the emergence of graphical interfaces and touch technology, but may use data-driven intelligence to realize the potential revolution of HCI [8]. The vigorous development of artificial intelligence has greatly promoted the intelligence of machines, and the in-depth study of the interaction between humans and machines has promoted new gesture interaction technology and automatic speech recognition technology [9][10][11].…”
Section: Introductionmentioning
confidence: 99%