2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) 2017
DOI: 10.1109/avss.2017.8078460
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Region bilinear convolutional neural networks for person re-identification

Abstract: In this work we propose a new architecture for person reidentification. As the task of re-identification is inherently associated with embedding learning and non-rigid appearance description, our architecture is based on the deep bilinear convolutional network (Bilinear-CNN) that has been proposed recently for fine-grained classification of highly non-rigid objects. While the last stages of the original Bilinear-CNN architecture completely removes the geometric information from consideration by performing orde… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
81
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 122 publications
(81 citation statements)
references
References 28 publications
0
81
0
Order By: Relevance
“…Part-based algorithms: By performing bilinear pooling in a more local way, an embedding can be learned, in which each pooling is confined to a predefined region [25]. Inspired by attention models, in [16,14,21], the attention-based deep neural networks are proposed to capture multiple attentions and select multi-scale attentive features.…”
Section: Related Workmentioning
confidence: 99%
“…Part-based algorithms: By performing bilinear pooling in a more local way, an embedding can be learned, in which each pooling is confined to a predefined region [25]. Inspired by attention models, in [16,14,21], the attention-based deep neural networks are proposed to capture multiple attentions and select multi-scale attentive features.…”
Section: Related Workmentioning
confidence: 99%
“…This strategy has been widely adopted in fine-grained recogni- tion [53][54][55] and shows promising performance. For person re-identification, Ustinova et al [56] adopted a bilinear pooling to aggregate two different appearance maps; this method does not generate part-aligned representations and leads to poor performance. Our approach uses a bilinear pooling to aggregate appearance and part maps to compute part-aligned representations.…”
Section: Related Workmentioning
confidence: 99%
“…Wu et al [79] improved performance based on Ahmed's idea by using a deeper architecture and a new optimization method. Other deep network structures such as [69] and [65] have been designed which also effectively solved the ReID problem on older ReID datasets. Qui et al [54] attempted to perform facial ReID by using domain adaptation methods to reconcile different facial poses; however, their experiments were performed on the Multi-PIE [15] dataset, in which face images have controlled poses and illuminations.…”
Section: B Face Re-identificationmentioning
confidence: 99%