2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00252
|View full text |Cite
|
Sign up to set email alerts
|

Meta Distribution Alignment for Generalizable Person Re-Identification

Abstract: Domain generalization person re-identification (DG-ReID) aims to train a model on source domains and generalize well on unseen domains. Vision Transformer usually yields better generalization ability than common CNN networks under distribution shifts. However, Transformerbased ReID models inevitably over-fit to domain-specific biases due to the supervised learning strategy on the source domain. We observe that while the global images of different IDs should have different features, their similar local parts (e… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 44 publications
(9 citation statements)
references
References 53 publications
0
9
0
Order By: Relevance
“…Its architecture, which relies on self-attention mechanisms, allows it to process entire sequences of data in parallel and capture long-range dependencies, making it highly efficient and scalable compared to traditional recurrent neural networks (RNNs) and convolutional neural networks (CNNs) [17,18]. At present, transformers [19,20] are being utilized across various computer vision tasks, garnering considerable interest from researchers for their robust modeling capabilities. In…”
Section: Related Workmentioning
confidence: 99%
“…Its architecture, which relies on self-attention mechanisms, allows it to process entire sequences of data in parallel and capture long-range dependencies, making it highly efficient and scalable compared to traditional recurrent neural networks (RNNs) and convolutional neural networks (CNNs) [17,18]. At present, transformers [19,20] are being utilized across various computer vision tasks, garnering considerable interest from researchers for their robust modeling capabilities. In…”
Section: Related Workmentioning
confidence: 99%
“…The global features and local features are fused by a multiscale channel attention module (MSCAM) [16].…”
Section: Fusion Featuresmentioning
confidence: 99%
“…Li et al [15] introduced the concept of stream shape regularization, which achieves stream shape alignment and balanced distribution adaptation by weighting the distributional importance of the source and target domains and preserving the neighborhood structure. Moreover, a novel Meta Distribution Alignment (MDA) method [16] is proposed to enable them to share similar distribution in a test-time training fashion. Ni et al [17] propose a pure Transformer model (termed Part-aware Transformer) for DG-ReID by designing a proxy task to mine local visual information shared by different IDs.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Group-aware label transfer (GLT) (20) was proposed for unsupervised domain adaptive person Re-ID; a similar unsupervised person Re-ID method was presented in (21) that focuses on considering the distribution discrepancy between cameras. A meta-distribution alignment strategy has been proposed for domain generalizable person ReID (22). Inspired by the latest advances in transformers, a new part-aware transformer (PAT) was developed for the occluded person Re-ID in (23) through a transformer encoder-decoder architecture.…”
Section: Related Workmentioning
confidence: 99%