2024
DOI: 10.3390/app14041535
|View full text |Cite
|
Sign up to set email alerts
|

Context Transformer and Adaptive Method with Visual Transformer for Robust Facial Expression Recognition

Lingxin Xiong,
Jicun Zhang,
Xiaojia Zheng
et al.

Abstract: In real-world scenarios, the facial expression recognition task faces several challenges, including lighting variations, image noise, face occlusion, and other factors, which limit the performance of existing models in dealing with complex situations. To cope with these problems, we introduce the CoT module between the CNN and ViT frameworks, which improves the ability to perceive subtle differences by learning the correlations between local area features at a fine-grained level, helping to maintain the consis… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 54 publications
0
2
0
Order By: Relevance
“…With the advancement in computer vision and image recognition technology, image recognition based on deep learning has been applied in various detection fields, such as face recognition [7], transmission line icing detection [8], and biomedical image analysis [9,10]. Ref.…”
Section: Introductionmentioning
confidence: 99%
“…With the advancement in computer vision and image recognition technology, image recognition based on deep learning has been applied in various detection fields, such as face recognition [7], transmission line icing detection [8], and biomedical image analysis [9,10]. Ref.…”
Section: Introductionmentioning
confidence: 99%
“…Chen et al [32] devised a face attention unit to capture facial structure information, while Lu et al [33] designed an external-internal split attention group to reconstruct clear facial images. Furthermore, the performance of transformers has already been proven and widely applied in computer vision tasks, such as image recognition [34,35] and image restoration [28,36,37]. The core of the transformers is a self-attention mechanism that can capture both long-and short-range correlations between words/pixels [38].…”
Section: Introductionmentioning
confidence: 99%