Multi-modal Facial Action Unit Detection with Large Pre-trained Models for the 5th Competition on Affective Behavior Analysis in-the-wild

Yin, Yunfei; Tran, M. T.; Chen, Di; Wang, Xinrui; Soleymani, M.R.

doi:10.48550/arxiv.2303.10590

Cited by 1 publication

(1 citation statement)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, we compare our results with those of ME-Graph [31], and our method outperforms theirs by an average F1-score of 3.1.These results demonstrate the effectiveness of our approach in detecting AUs. 51.0 CtyunAI [60] 48.9 HSE-NN-SberAI [39] 48.8 USTC-AC [51] 48.1 HFUT-MAC [59] 47.5 SCLAB-CNU [35] 45.6 USC-IHP [53] 42.9 Baseline [20] 36.5…”

Section: Results On Validation Setmentioning

confidence: 99%

Spatio-Temporal AU Relational Graph Representation Learning For Facial Action Units Detection

Wang¹,

Song²,

Luo³

et al. 2023

Preprint

View full text Add to dashboard Cite

This paper presents our Facial Action Units (AUs) recognition submission to the fifth Affective Behavior Analysis in-the-wild Competition (ABAW). Our approach consists of three main modules: (i) a pre-trained facial representation encoder which produce a strong facial representation from each input face image in the input sequence; (ii) an AUspecific feature generator that specifically learns a set of AU features from each facial representation; and (iii) a spatiotemporal graph learning module that constructs a spatiotemporal graph representation. This graph representation describes AUs contained in all frames and predicts the occurrence of each AU based on both the modeled spatial information within the corresponding face and the learned temporal dynamics among frames. The experimental results show that our approach outperformed the baseline and the spatio-temporal graph representation learning allows our model to generate the best results among all ablated systems. Our model ranks at the 4th place in the AU recognition track at the 5th ABAW Competition.

show abstract