ALFormer: Attribute localization transformer in pedestrian attribute recognition
Yuxin Liu,
Mingzhe Wang,
Chao Li
et al.
Abstract:Pedestrian attribute recognition is an important task for intelligent video
surveillance. However, existing methods struggle to accurately localize
discriminative regions for each attribute. We propose Attribute Localization
Transformer (ALFormer), a novel framework to improve spatial localization
through two key components. First, we introduce Mask Contrast Learning
(MCL) to suppress regional feature relevance, forcing the model to focus on
intrinsic spatial areas for each attribute. Secon… Show more
Set email alert for when this publication receives citations?
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.