The idea of most trackers based on Siamese network is off-line training and online tracking. In fact, online tracking is conducted in terms of deep features, which are extracted from the predefined network trained on a large amount of data off-line. However, these features are the general representation for similar objects, and therefore, their discrimination ability is not enough to identify the current tracking target, particularly distractors, from the background. To tackle this problem, we propose to update the features extracted by a Siamese network online. These features can fit the target variations when tracking is on-thefly. Especially, we extract the common features from the shallow convolutional layers trained off-line, and then, they are employed as inputs of the deep convolutional layers to learn the special features of the current target online. Besides, an integrated updating strategy is proposed to accelerate network convergence. It is beneficial to enhance the discrimination ability of the learned features to identify the current target from the background and distractors. We conducted abundant experiments on the OTB2015 and VOT2016 databases. And the results demonstrate that our tracker effectively improves the baseline algorithm and performs favorably against most of the state-of-the-art trackers in the comparison of accuracy and robustness.INDEX TERMS Target tracking, Siamese network, offline training, online tracking.
Convolutional neural networks can efficiently exploit sophisticated hierarchical features which have different properties for visual tracking problem. In this paper, by using multilayer convolutional features jointly and constructing a scale pyramid, we propose an online scale adaptive tracking method. We construct two separate correlation filters for translation and scale estimations. The translation filters improve the accuracy of target localization by a weighted fusion of multiple convolutional layers. Meanwhile, the separate scale filters achieve the optimal and fast scale estimation by a scale pyramid. This design decreases the mutual errors of translation and scale estimations, and reduces computational complexity efficiently. Moreover, in order to solve the problem of tracking drifts due to the severe occlusion or serious appearance changes of the target, we present a new adaptive and selective update mechanism to update the translation filters effectively. Extensive experimental results show that our proposed method achieves the excellent overall performance compared with the state-of-the-art methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.