2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.349
|View full text |Cite
|
Sign up to set email alerts
|

Deeply-Learned Part-Aligned Representations for Person Re-identification

Abstract: In this paper, we address the problem of person reidentification, which refers to associating the persons captured from different cameras. We propose a simple yet effective human part-aligned representation for handling the body part misalignment problem. Our approach decomposes the human body into regions (parts) which are discriminative for person matching, accordingly computes the representations over the regions, and aggregates the similarities computed between the corresponding regions of a pair of probe … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
487
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 770 publications
(492 citation statements)
references
References 57 publications
5
487
0
Order By: Relevance
“…To alleviate the adverse influence from the backgrounds, numerous methods [1], [2], [3], [4], [5], [6], [7], [8], [9] have been proposed. In [1], [2], [3], human landmark detectors are used to extract human keypoints and generate human part bounding boxes.…”
Section: Introductionmentioning
confidence: 99%
“…To alleviate the adverse influence from the backgrounds, numerous methods [1], [2], [3], [4], [5], [6], [7], [8], [9] have been proposed. In [1], [2], [3], human landmark detectors are used to extract human keypoints and generate human part bounding boxes.…”
Section: Introductionmentioning
confidence: 99%
“…Only K-LFDA when trained with mom LE [24] feature attains comparable performance than DMN. However, motivated to resolve the challenges for reidentification in real world (i.e., multimodal image space, and diverse impostors) IRM3 + CVI ( = 15) has much better results than MCP-CNN [39], E2E-CAN [31], Quadruplet-Net [33], and JLML [34], while our IRM3 + CVI ( = 15) has 1.49% higher rank@1 than DLPA [32]. DLPA extracts deep features by semantically aligning body parts, as well as rectifying pose variations.…”
Section: Results On Cuhk01mentioning
confidence: 99%
“…These high results demonstrate the fact that CUHK03 is the largest dataset among all and, thus, can help in learning a more discriminative DMN. Even though both JLML [34] and DLPA [32] learn deep body features with global and local body parts alignment, as well as, pose alignment, however, our IMR3 approach benefitted with transform specific metrics empowered with impostors rejection still maintained to attain better results. Our IRM3 Advances in Multimedia 9 considers optimizing all the rank orders simultaneously and, thus, have large gain at rank@5 and rank@10 in Labelled setting.…”
Section: Results On Cuhk03mentioning
confidence: 99%
“…Yao et al [26] proposed the Part Loss Networks (PL-Net) to automatically detect human parts and cross train them with the main identity task. Zhao et al [27] follows the concept of attention model and uses a part map detector to extract multiple body regions in order to compute their corresponding representations. The model is learned through triplet loss function.…”
Section: Aam Heatmapmentioning
confidence: 99%