2023
DOI: 10.1109/taffc.2022.3158234
|View full text |Cite
|
Sign up to set email alerts
|

Weakly-Supervised Learning for Fine-Grained Emotion Recognition Using Physiological Signals

Abstract: Instead of predicting just one emotion for one activity (e.g., video watching), fine-grained emotion recognition enables more temporally precise recognition. Previous works on fine-grained emotion recognition require segment-by-segment, fine-grained emotion labels to train the recognition algorithm. However, experiments to collect these labels are costly and time-consuming compared with only collecting one emotion label after the user watched that stimulus (i.e., the post-stimuli emotion labels). To recognize … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 110 publications
0
8
0
Order By: Relevance
“…The results in Figure 3 suggest that depressed patients may only display affective state associated with depressive level for a small fraction of the time. In the field of emotion recognition using physiological signals, a weakly supervised learning work [58] has also shown that subjects' emotion labels while watching videos are represented only by the most salient or recent emotion. In addition, by analyzing the attention weights between the two groups, we found that depression subjects and healthy subjects have significant differences in their responses to negative video stimulation.…”
Section: Discussionmentioning
confidence: 99%
“…The results in Figure 3 suggest that depressed patients may only display affective state associated with depressive level for a small fraction of the time. In the field of emotion recognition using physiological signals, a weakly supervised learning work [58] has also shown that subjects' emotion labels while watching videos are represented only by the most salient or recent emotion. In addition, by analyzing the attention weights between the two groups, we found that depression subjects and healthy subjects have significant differences in their responses to negative video stimulation.…”
Section: Discussionmentioning
confidence: 99%
“…Tests achieved an averaged accuracy of 76.04% (arousal), 76.62% (valence), and 57.62% (four quadrant with neutral). Additionally, Zhang et al [24] proposed a method based on deep multiple instance learning, to recognize emotions at a finer granularity level when trained with post-stimuli labels, where instances are weakly-supervised in training stage. In Elalamy et al [25] work, recurrence plots were used to obtain 2D representations of physiological activity, to get less subject dependent and better suited representation for non-stationary signals such as EDA, in conjunction with ECG and PPG.…”
Section: A Peripheral Signals-based Methodsmentioning
confidence: 99%
“…Despite the continuous nature of the emotion labels in the RECOLA and CASE datasets, some works utilize these to only predict discrete categories. Zhang et al [42] for instance, used deep multiple instance learning to classify labels in the CASE dataset into the low, neutral and high categories. Here, classification was based on features extracted from skin conductance, blood volume pulse (BVP), skin temperature and heart rate.…”
Section: A Related Work and Limitations -Machine Learning Methodsmentioning
confidence: 99%