2023
DOI: 10.48550/arxiv.2303.09158
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Facial Affect Recognition based on Transformer Encoder and Audiovisual Fusion for the ABAW5 Challenge

Abstract: In this paper, we present our solutions for the 5th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW), which includes four sub-challenges of Valence-Arousal (VA) Estimation, Expression (Expr) Classification, Action Unit (AU) Detection and Emotional Reaction Intensity (ERI) Estimation. The 5th ABAW competition focuses on facial affect recognition utilizing different modalities and datasets. In our work, we extract powerful audio and visual features using a large number of sota models. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 39 publications
0
1
0
Order By: Relevance
“…Moreover, we compare our results with those of ME-Graph [31], and our method outperforms theirs by an average F1-score of 3.1.These results demonstrate the effectiveness of our approach in detecting AUs. 51.0 CtyunAI [60] 48.9 HSE-NN-SberAI [39] 48.8 USTC-AC [51] 48.1 HFUT-MAC [59] 47.5 SCLAB-CNU [35] 45.6 USC-IHP [53] 42.9 Baseline [20] 36.5…”
Section: Results On Validation Setmentioning
confidence: 99%
“…Moreover, we compare our results with those of ME-Graph [31], and our method outperforms theirs by an average F1-score of 3.1.These results demonstrate the effectiveness of our approach in detecting AUs. 51.0 CtyunAI [60] 48.9 HSE-NN-SberAI [39] 48.8 USTC-AC [51] 48.1 HFUT-MAC [59] 47.5 SCLAB-CNU [35] 45.6 USC-IHP [53] 42.9 Baseline [20] 36.5…”
Section: Results On Validation Setmentioning
confidence: 99%
“…Achieving a 53.07% F1-score, while not considered high, is still acceptable and reasonable in this type of application [44,[46][47][48][49][50]. The complexity and inherent ambiguity of emotion recognition, coupled with the dataset's representation, make it challenging to achieve notable performance.…”
Section: Discussionmentioning
confidence: 99%
“…F1-Score Yu et al [46] 0.3075 Xue et al [47] 0.3218 Savchenko [48] 0.3292 Zhang et al [49] 0.3337 Zhou et al [50] 0.3532 Proposed approach 0.5307 * Value in bold represents the best performance.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation