The Siamese tracker consists of two components: a classification and a
regression networks. Despite their different roles, most Siamese
trackers have similar feature fusion modules in the two networks,
leading to the neglect of their unique characteristics. In this work, we
experimentally discover that the two networks place different levels of
emphasis on different types of information. Specifically, regression
tends to rely on semantic information, while classification places more
emphasis on global information. Therefore, we propose a new tracking
structure named SGTrack, which includes a semantic augmentation fusion
(SAF) for regression and a global relevance fusion (GRF) for
classification. It allows us to unlock the full potential of both
networks. The experimental results of our method on five benchmarks
provide evidence of a notable improvement in tracking performance, while
preserving real-time speed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.