2023
DOI: 10.1109/tip.2023.3238642
|View full text |Cite
|
Sign up to set email alerts
|

GiT: Graph Interactive Transformer for Vehicle Re-Identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
35
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 80 publications
(35 citation statements)
references
References 94 publications
0
35
0
Order By: Relevance
“…Unlike graph transformers being applied to general node classification, relatively few studies have applied graphs to ViTs for vision applications. Shen et al [ 17 ] proposed a graph interactive transformer (GiT) for vehicle reidentification. Using this method, the GiT is divided into two modules: the original transformer module for extracting powerful global patch features and a local correlation graph (LCG) module for extracting local features that are distinct within the patch.…”
Section: Related Studiesmentioning
confidence: 99%
See 1 more Smart Citation
“…Unlike graph transformers being applied to general node classification, relatively few studies have applied graphs to ViTs for vision applications. Shen et al [ 17 ] proposed a graph interactive transformer (GiT) for vehicle reidentification. Using this method, the GiT is divided into two modules: the original transformer module for extracting powerful global patch features and a local correlation graph (LCG) module for extracting local features that are distinct within the patch.…”
Section: Related Studiesmentioning
confidence: 99%
“…Unlike with other graph-based transformers [ 17 , 18 , 19 ], which apply graphs and attention in parallel and combine the outputs, this study is the first attempt to apply a graph inside the transformer head and replace MHA with a few GHA mechanisms. Moreover, there is no need for a class token in patch embedding, and thus the number of operations can be reduced.…”
Section: Introductionmentioning
confidence: 99%
“…EMRN 29 proposes a multi-resolution features dimension uniform module to fix dimensional features from images of varying resolutions, thus solving the multi-scale problem. Besides, GiT 30 uses a graph network approach to propose a structure where graphs and transformers interact constantly, enabling close collaboration between global and local features for vehicle Re-ID. The dual-relational attention module (DRAM) 31 models the importance of feature points in the spatial dimension and the channel dimension to form a three-dimensional attention module to mine more detailed semantic information.…”
Section: Related Work On the Vehicle Re-id Taskmentioning
confidence: 99%
“…For example, in the task of face recognition (Boutros et al, 2022 ), deep learning methods effectively capture the facial information under complex conditions, enabling accurate identification of individuals based on semantic attributes. Similarly, in vehicle re-identification (Shen et al, 2023 ), the metric learning framework facilitates reliable screening of complex multi-view positive samples, leading to precise consensus decision-making despite variations in multi-sensor data. A prominent network structure that implements the metric learning framework is the siamese neural network, exemplified by MatchNet (Han et al, 2015 ).…”
Section: Introductionmentioning
confidence: 99%