2022
DOI: 10.1002/cpe.7246
|View full text |Cite
|
Sign up to set email alerts
|

Disentangling classification and regression in Siamese‐based network for visual tracking

Abstract: Summary Siamese‐based trackers have made great progress in visual tracking community, however, the shared structure of network between classification and regression tasks limits the ability of the trackers to obtain more robust classification prediction and more accurate regression prediction. In this paper, we propose an effective visual tracking framework (named Siamese Disentangled Tracking‐Head, SiamDTH), which disentangles classification and regression in Siamese‐based network for visual tracking from two… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 34 publications
0
4
0
Order By: Relevance
“…Achieving optimal accuracy for other document types remains a challenge. The model's performance was relatively lower for less important documents, such as passports, bank books, and driver's licenses (see examples in Figures 22,21,and 23). because of the differences in document templates, data availability, data diversity, and complexities specific to each document type.…”
Section: ) End-to-end Results From All Servicesmentioning
confidence: 99%
See 1 more Smart Citation
“…Achieving optimal accuracy for other document types remains a challenge. The model's performance was relatively lower for less important documents, such as passports, bank books, and driver's licenses (see examples in Figures 22,21,and 23). because of the differences in document templates, data availability, data diversity, and complexities specific to each document type.…”
Section: ) End-to-end Results From All Servicesmentioning
confidence: 99%
“…More examples of using multiple layers from the FPN, Zhang et al [23] used multiple layers of FPN. Utilizing multiple layers in this study allowed the network to acquire hierarchical representations.…”
Section: B Multiple Layers-based Detectormentioning
confidence: 99%
“…Reference [42] proposes an active pedestrian detector to address the performance degradation problem of smallsized pedestrian detection by leveraging the rich feature hierarchy and initial pedestrian proposals provided by ResNet and Faster R-CNN, which operate explicitly on multilayer neural representations, achieving significant false detection rate reduction. Reference [43] An effective visual tracking framework called SiamDTH is proposed, which distinguishes in Siamese-based network-based visual tracking through feature decoupling and different tracking head structures. Classification and regression tasks, thereby obtaining more robust classification predictions and accurate regression predictions.…”
Section: E Pedestrian Detectionmentioning
confidence: 99%
“…O BJECT detection [1], as a longstanding, fundamental and challenging problem in computer vision [2], has been an active field of research for several decades [3], [4]. The task of object detection is to identify object categories and predict the location of each object in an image by a bounding box, and there are many real world applications [5] based on this task, such as face detection and pedestrian detection [6].…”
Section: Introductionmentioning
confidence: 99%