2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022
DOI: 10.1109/cvprw56347.2022.00262
|View full text |Cite
|
Sign up to set email alerts
|

Classification of Facial Expression In-the-Wild based on Ensemble of Multi-head Cross Attention Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(2 citation statements)
references
References 15 publications
0
1
0
Order By: Relevance
“…Second, time-shifting and time-scaling methods further enrich the diversity of the dataset by performing shifting and stretching operations on the time-series data to simulate fault scenarios at different time scales. In addition, noise injection techniques [38] simulate real-world environmental disturbances and noises, which helps to improve the noise immunity of the model. However inappropriate noise injection strategies may instead lead to disturbed model performance.…”
Section: Fault Recording Data Augmentationmentioning
confidence: 99%
“…Second, time-shifting and time-scaling methods further enrich the diversity of the dataset by performing shifting and stretching operations on the time-series data to simulate fault scenarios at different time scales. In addition, noise injection techniques [38] simulate real-world environmental disturbances and noises, which helps to improve the noise immunity of the model. However inappropriate noise injection strategies may instead lead to disturbed model performance.…”
Section: Fault Recording Data Augmentationmentioning
confidence: 99%
“…Zhang et al [53]utilized the multimodal information from the images, audio and text and proposed a unified multimodal framework to fully use the emotion information, which achieved the best performance in ABAW3 competition. Jeong et al [15]proposed a multi-head cross attention networks and pretrained on Glint360K [1] and some private commercial datasets. Xue et al [46] proposed the Coarse-to-Fine Cascaded networks (CFC) to address the label ambiguity problem and used smooth predicting method to post-process the extracted features.…”
Section: Affective Behavior Analysis In-the-wildmentioning
confidence: 99%