2019
DOI: 10.1109/tip.2019.2908065
|View full text |Cite
|
Sign up to set email alerts
|

Discriminative Feature Learning With Foreground Attention for Person Re-Identification

Abstract: The performance of person re-identification (Re-ID) has been seriously effected by the large cross-view appearance variations caused by mutual occlusions and background clutters. Hence learning a feature representation that can adaptively emphasize the foreground persons becomes very critical to solve the person Re-ID problem. In this paper, we propose a simple yet effective foreground attentive neural network (FANN) to learn a discriminative feature representation for person Re-ID, which can adaptively enhanc… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 63 publications
(13 citation statements)
references
References 70 publications
0
13
0
Order By: Relevance
“…bounding-boxes or masks of instances) than the overall count which is computational expensive and mostly suitable in lower density crowds. In overcrowded scenes, clutters and severe occlusions make it unfeasible to detect every single person, despite the progresses in related fields [19,10,47,17,33,43,61,50,48,34,49,55,60,52]. Secondly, training object detectors require bounding-box or instance mask annotations, which is much more labor-intensive in dense crowds.…”
Section: Related Workmentioning
confidence: 99%
“…bounding-boxes or masks of instances) than the overall count which is computational expensive and mostly suitable in lower density crowds. In overcrowded scenes, clutters and severe occlusions make it unfeasible to detect every single person, despite the progresses in related fields [19,10,47,17,33,43,61,50,48,34,49,55,60,52]. Secondly, training object detectors require bounding-box or instance mask annotations, which is much more labor-intensive in dense crowds.…”
Section: Related Workmentioning
confidence: 99%
“…Author in [35] proposed a Pyramid Person Matching Network to learn the correspondence of misalignment components in image pairs. Attention mechanisms are applied in many models [20][21][22][23] to focus on salient parts to extract more useful and discriminative information. Recent studies start to consider part-level features as complementary features for their models due to its finegrained information.…”
Section: B Cnn-based Re-idmentioning
confidence: 99%
“…In recent years, with the growth of convolutional neural networks (CNNs) and careful-annotated benchmarks, CNN-based models which learn deep features from data outperform hand-crafted methods by a large margin and achieve remarkable accuracy [12]. To obtain more discriminative features to deal with the inter-class challenge, some works try to extract local features from local regions in different ways, such as pose normalization [13][14][15], part-based learning [16][17][18][19], or attention mechanism [20][21][22][23]. Although these great works gain very high performance in accuracy and mAP, they are still only employed deep features, which do not contain semantic information and cannot be explained by human.…”
Section: Introductionmentioning
confidence: 99%
“…Person REID attracts much more research interest by the development of deep learning technology in recent years. The related models are classified as global-based [8], [9], and part-based feature representations [18]- [20].…”
Section: Related Work a Image Based Person Re-identificationmentioning
confidence: 99%
“…Most of the proposed methods make an assumption that all pedestrian images have sufficiently high resolution captured by a variety of cameras [8], [9]. However, the distances The associate editor coordinating the review of this manuscript and approving it for publication was Gianluigi Ciocca .…”
Section: Introductionmentioning
confidence: 99%