2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00076
|View full text |Cite
|
Sign up to set email alerts
|

Densely Semantically Aligned Person Re-Identification

Abstract: We propose a densely semantically aligned person reidentification framework. It fundamentally addresses the body misalignment problem caused by pose/viewpoint variations, imperfect person detection, occlusion, etc. By leveraging the estimation of the dense semantics of a person image, we construct a set of densely semantically aligned part images (DSAP-images), where the same spatial positions have the same semantics across different images. We design a two-stream network that consists of a main full image str… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
193
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 286 publications
(194 citation statements)
references
References 63 publications
1
193
0
Order By: Relevance
“…The softmax output is supervised by the ID label of the training images through the cross-entropy loss. Employing extra crossentropy loss slightly improves the re-ID accuracy of both VANet and the baseline, which is consistent with [1,35].…”
Section: Datasets and Settingssupporting
confidence: 78%
See 1 more Smart Citation
“…The softmax output is supervised by the ID label of the training images through the cross-entropy loss. Employing extra crossentropy loss slightly improves the re-ID accuracy of both VANet and the baseline, which is consistent with [1,35].…”
Section: Datasets and Settingssupporting
confidence: 78%
“…In addition to the triplet loss, we adopt a cross-entropy loss, following several recent re-ID methods [1,35]. Specifically, we append an ID-classifier upon the featureembedding layer.…”
Section: Datasets and Settingsmentioning
confidence: 99%
“…[42,43,27] devise attention mechanisms to focus feature learning on the foreground person regions. In [74,44,63,46,51,73], body part-specific CNNs are learned by means of off-the-shelf pose detectors. In [26,23,75], CNNs are branched to learn representations of global and local image regions.…”
Section: Related Work Deep Reid Architecturesmentioning
confidence: 99%
“…When realizing the limitation of purely global feature learning, many attempts to local feature learning haven arisen. Some methods [7], [6], [33], [18], [34] refer to external clues of pose estimation or body part parsing to extract body part features of persons. [7], [6] utilize the structural part by pose estimation prediction to form relatively precise local region proposals for further representations.…”
Section: Related Workmentioning
confidence: 99%