2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00730
|View full text |Cite
|
Sign up to set email alerts
|

AANet: Attribute Attention Network for Person Re-Identifications

Abstract: This paper proposes Attribute Attention Network (AANet), a new architecture that integrates person attributes and attribute attention maps into a classification framework to solve the person re-identification (re-ID) problem. Many person re-ID models typically employ semantic cues such as body parts or human pose to improve the re-ID performance. Attribute information, however, is often not utilized. The proposed AANet leverages on a baseline model that uses body parts and integrates the key attribute informat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
164
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 342 publications
(164 citation statements)
references
References 24 publications
(48 reference statements)
0
164
0
Order By: Relevance
“…We compare the proposed method with 33 recent published works including (1) global feature based methods which aims to learn the global feature from the feature map directly, including PAN [74], DMML [7], DCDS [1], VCFL [30], MVPM [41], LRDNN [79], RB [35], LITM [63], IANet [23], Sphere [14], BNNeck [32], OSNet [78], AANet [46], DG-Net [72], BDB [12], Circle [42], SFT [31], (2) part based methods including PCB+RPP [43], Local [57], HPM [16], CASN [71], AutoReID [34], MGN [49], BHP [20] and Pyramidal [68] which utilize the semantic parts or horizontal stripes to extract part-level feature, and (3) attention based methods including MHAN [3], CAMA [58], SONA [53], CAR [80], SCAL [6], ABD-Net [8], DAAF [10] and RGA [65]. These methods are categorized into 3 types based on different backbones: the ones which employ ResNet-50 directly, the ones which modify ResNet-50 by introducing additional branches, attention subnets or dilated convolution, and the others which don't use ResNet-50.…”
Section: Comparison Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We compare the proposed method with 33 recent published works including (1) global feature based methods which aims to learn the global feature from the feature map directly, including PAN [74], DMML [7], DCDS [1], VCFL [30], MVPM [41], LRDNN [79], RB [35], LITM [63], IANet [23], Sphere [14], BNNeck [32], OSNet [78], AANet [46], DG-Net [72], BDB [12], Circle [42], SFT [31], (2) part based methods including PCB+RPP [43], Local [57], HPM [16], CASN [71], AutoReID [34], MGN [49], BHP [20] and Pyramidal [68] which utilize the semantic parts or horizontal stripes to extract part-level feature, and (3) attention based methods including MHAN [3], CAMA [58], SONA [53], CAR [80], SCAL [6], ABD-Net [8], DAAF [10] and RGA [65]. These methods are categorized into 3 types based on different backbones: the ones which employ ResNet-50 directly, the ones which modify ResNet-50 by introducing additional branches, attention subnets or dilated convolution, and the others which don't use ResNet-50.…”
Section: Comparison Resultsmentioning
confidence: 99%
“…Holistic Features Based Methods Given a backbone C-NN such as ResNet-50 [21] or other network architectures [2,51,71,78], this type of methods learns discriminative holistic features from the feature map directly. Specifically, they aim to learn the features by improving loss functions [9,14,22,31,41,42,50,55,63], improving the training techniques [1,4,12,24,32,35,37,54], adding additional network modules [23,23,51,62], using extra semantic annotations [30,46,47,79] or generating more training samples [17,33,72,76,77]. Besides, more recent studies [3,6,8,10,27,28,38,46,48,53,58,61,64,…”
Section: Related Workmentioning
confidence: 99%
“…Semantic attributes [46,25,7] have been exploited as feature representations for person reidentification tasks. Previous work [47,6,20,42,58] leverages the attribute labels provided by original dataset to generate attribute-aware feature representation. Different from previous work, our latent part branch can attend to important visual cues without relying on detailed supervision signals from the limited predefined attributes.…”
Section: Related Workmentioning
confidence: 99%
“…Most re-ID methods are in a supervised manner, in which sufficient labeled images are given. Recently, with the developing of deep learning approaches [36,35,34], methods with convolutional neural networks have dominated the re-ID community [12,26,45,46,25,16]. Specifically, methods proposed to learn discriminative features from parts of pedestrian images achieve impressive performance [24,8,23].…”
Section: Supervised Person Re-identificationmentioning
confidence: 99%
“…Given a query image, person re-identification (re-ID) aims to match the person across multiple non-overlapped cameras. In the last few years, person re-ID has drawn increasing research attention [12,45,46,25,24,23], due to its wide range of applications such as finding people of interest (e.g., lost kids or criminals) and person tracking. However, most of the proposed methods are of supervised manner, which requires intensive manual labeling and is not applicable to real-world applications.…”
Section: Introductionmentioning
confidence: 99%