2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00855
|View full text |Cite
|
Sign up to set email alerts
|

Correlation-Aware Deep Tracking

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 115 publications
(22 citation statements)
references
References 39 publications
0
22
0
Order By: Relevance
“…For example, inspired by Transformers, TransT [6] uses attentionbased feature fusion to combine features of the object template and search image. More recently, several works utilize Transformers as direct predictors to achieve a new state of the art, such as STARK [71], ToMP [47] and SBT [70]. These models tokenize frame features from a ResNet [25] encoder, and use a Transformer to predict the bounding box and object presence score with the feature tokens.…”
Section: Single Object Tracking Methodologiesmentioning
confidence: 99%
“…For example, inspired by Transformers, TransT [6] uses attentionbased feature fusion to combine features of the object template and search image. More recently, several works utilize Transformers as direct predictors to achieve a new state of the art, such as STARK [71], ToMP [47] and SBT [70]. These models tokenize frame features from a ResNet [25] encoder, and use a Transformer to predict the bounding box and object presence score with the feature tokens.…”
Section: Single Object Tracking Methodologiesmentioning
confidence: 99%
“…In recent years, some 2D one-stream frameworks [33], [34] have been proposed to perform their unique character in 2D object tracking. Ye et al [33] released some Siamesebased framework's problem: the extracted features lack the awareness of the target and have limited target-background discriminability.…”
Section: D One-stream Trackingmentioning
confidence: 99%
“…To tackle these issues, Ye et alintroduced a one-stream tracking framework which can bridge the templatesearch image pairs with bidirectional information flows. Single Branch Transformer (SBT) [34] suppresses non-target features and obtain instance-varying features by extensively matching the features of the two images through one-stream backbone. To sum up, one-stream frameworks possess the ability to extract more discriminative features than ordinary Siamese frameworks.…”
Section: D One-stream Trackingmentioning
confidence: 99%
“…Xie et al [36] found that the features extracted by Siamese-like networks cannot completely distinguish between the tracked target and the distractor objects. Therefore, by deeply embedding cross-image feature correlation in multiple layers of the feature network, a new target-dependent feature network is proposed, and it allows the features of the search and template images to be deeply fused for tracking.…”
Section: Related Workmentioning
confidence: 99%