2022
DOI: 10.1016/j.patrec.2022.05.006
|View full text |Cite
|
Sign up to set email alerts
|

Occlusion-aware spatial attention transformer for occluded object recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…Gajbhiye et al [34] combined the channel attention mechanism and convolutional neural network to generate letters for remote sensing images and conducted model evaluation in the test set, obtaining good results. Additionally, many scholars have integrated the spatial attention module into the deep learning network model to improve the training effect of the model [35][36][37]. Some geological experts focus on the spatial distribution of geological images and data and achieve good results in image modeling and image classification.…”
Section: The Mechanism Of Attention Used In Geodata Analysismentioning
confidence: 99%
“…Gajbhiye et al [34] combined the channel attention mechanism and convolutional neural network to generate letters for remote sensing images and conducted model evaluation in the test set, obtaining good results. Additionally, many scholars have integrated the spatial attention module into the deep learning network model to improve the training effect of the model [35][36][37]. Some geological experts focus on the spatial distribution of geological images and data and achieve good results in image modeling and image classification.…”
Section: The Mechanism Of Attention Used In Geodata Analysismentioning
confidence: 99%
“…Ma et al [ 17 ] proposed a robust face recognition approach based on a sparse network with limited probability, built a sparse image network with limited probability, and acquired the overall training images from a global perspective for recognition. Heo et al [ 18 ] proposed an occlusion-aware spatial attention transformer (OSAT) architecture based on a visual transformer (ViT), CutMix strengthening, and occlusion mask predictor (OMP) to solve the occlusion problem. Xu et al [ 19 ] proposed a double-active-layer-based CNN to recognize facial expressions with high accuracy by learning robust and discriminative features from data.…”
Section: Related Workmentioning
confidence: 99%