2015
DOI: 10.1109/tip.2015.2403231
|View full text |Cite
|
Sign up to set email alerts
|

Video Tracking Using Learned Hierarchical Features

Abstract: In this paper, we propose an approach to learn hierarchical features for visual object tracking. First, we offline learn features robust to diverse motion patterns from auxiliary video sequences. The hierarchical features are learned via a two-layer convolutional neural network. Embedding the temporal slowness constraint in the stacked architecture makes the learned features robust to complicated motion transformations, which is important for visual object tracking. Then, given a target video sequence, we prop… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
78
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 177 publications
(80 citation statements)
references
References 65 publications
2
78
0
Order By: Relevance
“…Li et al 23 applied a single-CNN on visual tracking tasks without pretraining and combined it with multiple image cues to improve the tracking success rate. Wang et al 24 used hierarchical features for tracking by training a two-layer CNN on an auxiliary dataset and gained a good result in complicated tracking situations. Zhang et al 25 proposed a convolutional network-based tracker (CNT), which combined the local structure feature and global geometric information of tracking targets and attained a state-of-the-art performance.…”
Section: Related Workmentioning
confidence: 99%
“…Li et al 23 applied a single-CNN on visual tracking tasks without pretraining and combined it with multiple image cues to improve the tracking success rate. Wang et al 24 used hierarchical features for tracking by training a two-layer CNN on an auxiliary dataset and gained a good result in complicated tracking situations. Zhang et al 25 proposed a convolutional network-based tracker (CNT), which combined the local structure feature and global geometric information of tracking targets and attained a state-of-the-art performance.…”
Section: Related Workmentioning
confidence: 99%
“…Wang and Yeung [34] propose an autoencoder based tracking method. Instead of using unrelated images for pretraining, Wang et al [57] propose a tracking method which prelearns features robust to diverse motion patterns from auxiliary video sequences. However, they only evaluate the method on 10 video sequences.…”
Section: Related Workmentioning
confidence: 99%
“…With the development of deep learning technology, convolutional neural networks (CNNs) have demonstrated their outstanding representation power in a wide range of computer vision applications [11]. And some tracking algorithms using the representations from CNNs have been proposed [12], [13], [22]. In addition, Ma et al [14] utilized convolutional features and learned correlation filters on each CNN layer without re-training.…”
Section: Introductionmentioning
confidence: 99%