2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00380
|View full text |Cite
|
Sign up to set email alerts
|

Omni-Scale Feature Learning for Person Re-Identification

Abstract: As an instance-level recognition problem, person reidentification (ReID) relies on discriminative features, which not only capture different spatial scales but also encapsulate an arbitrary combination of multiple scales. We callse features of both homogeneous and heterogeneous scales omni-scale features. In this paper, a novel deep ReID CNN is designed, termed Omni-Scale Network (OSNet), for omni-scale feature learning. This is achieved by designing a residual block composed of multiple convolutional feature … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
405
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 778 publications
(406 citation statements)
references
References 68 publications
(130 reference statements)
1
405
0
Order By: Relevance
“…We compare the proposed method with 33 recent published works including (1) global feature based methods which aims to learn the global feature from the feature map directly, including PAN [74], DMML [7], DCDS [1], VCFL [30], MVPM [41], LRDNN [79], RB [35], LITM [63], IANet [23], Sphere [14], BNNeck [32], OSNet [78], AANet [46], DG-Net [72], BDB [12], Circle [42], SFT [31], (2) part based methods including PCB+RPP [43], Local [57], HPM [16], CASN [71], AutoReID [34], MGN [49], BHP [20] and Pyramidal [68] which utilize the semantic parts or horizontal stripes to extract part-level feature, and (3) attention based methods including MHAN [3], CAMA [58], SONA [53], CAR [80], SCAL [6], ABD-Net [8], DAAF [10] and RGA [65]. These methods are categorized into 3 types based on different backbones: the ones which employ ResNet-50 directly, the ones which modify ResNet-50 by introducing additional branches, attention subnets or dilated convolution, and the others which don't use ResNet-50.…”
Section: Comparison Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We compare the proposed method with 33 recent published works including (1) global feature based methods which aims to learn the global feature from the feature map directly, including PAN [74], DMML [7], DCDS [1], VCFL [30], MVPM [41], LRDNN [79], RB [35], LITM [63], IANet [23], Sphere [14], BNNeck [32], OSNet [78], AANet [46], DG-Net [72], BDB [12], Circle [42], SFT [31], (2) part based methods including PCB+RPP [43], Local [57], HPM [16], CASN [71], AutoReID [34], MGN [49], BHP [20] and Pyramidal [68] which utilize the semantic parts or horizontal stripes to extract part-level feature, and (3) attention based methods including MHAN [3], CAMA [58], SONA [53], CAR [80], SCAL [6], ABD-Net [8], DAAF [10] and RGA [65]. These methods are categorized into 3 types based on different backbones: the ones which employ ResNet-50 directly, the ones which modify ResNet-50 by introducing additional branches, attention subnets or dilated convolution, and the others which don't use ResNet-50.…”
Section: Comparison Resultsmentioning
confidence: 99%
“…We mainly review the former which utilize deep learning to extract the feature. Holistic Features Based Methods Given a backbone C-NN such as ResNet-50 [21] or other network architectures [2,51,71,78], this type of methods learns discriminative holistic features from the feature map directly. Specifically, they aim to learn the features by improving loss functions [9,14,22,31,41,42,50,55,63], improving the training techniques [1,4,12,24,32,35,37,54], adding additional network modules [23,23,51,62], using extra semantic annotations [30,46,47,79] or generating more training samples [17,33,72,76,77].…”
Section: Related Workmentioning
confidence: 99%
“…By learning jointly on global and local features, it aims to address existing drawbacks. Xie et al [51] proposed PLR-OSNet, which introduces Part-level resolution (PLR) into Omni-Scale Network (OSNet) [52]. It has two branches including both global and local feature representations.…”
Section: Background and Related Work A Person Reid Methodsmentioning
confidence: 99%
“…The heart of pedestrian tracking is consistent reidentification (ReID) of those pedestrians throughout the frames of videos across multiple cameras. Similarly, on the re-identification side, recent methods leverage CNNs to extract unique features among persons [17][18][19][20][21][22][23][24]. The work in [25] learns the spatial and temporal behavior of objects by translating the feature map of the Region of Interest (RoI) into an adaptive body-action unit.…”
Section: Related Work a Pedestrian Detection Re-identificationmentioning
confidence: 99%