2019
DOI: 10.3390/electronics8101084
|View full text |Cite
|
Sign up to set email alerts
|

Fully Convolutional Single-Crop Siamese Networks for Real-Time Visual Object Tracking

Abstract: The visual object tracking problem seeks to track an arbitrary object in a video, and many deep convolutional neural network-based algorithms have achieved significant performance improvements in recent years. However, most of them do not guarantee real-time operation due to the large computation overhead for deep feature extraction. This paper presents a single-crop visual object tracking algorithm based on a fully convolutional Siamese network (SiamFC). The proposed algorithm significantly reduces the comput… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 33 publications
0
4
0
Order By: Relevance
“…[6] further improves tracking speed by mapping feature calculations into Fourier space. With the development of deep learning, the discrimination ability of deep features has been improved, [15,16] takes advantage of deep features to track objects. To improve the accuracy of the model, [17,18] combines the deep features of different layers with semantic and spatial information while [7] combines hand-craft and deep features to enhance the discriminative ability of the model.…”
Section: Correlation Filter-based Methodsmentioning
confidence: 99%
“…[6] further improves tracking speed by mapping feature calculations into Fourier space. With the development of deep learning, the discrimination ability of deep features has been improved, [15,16] takes advantage of deep features to track objects. To improve the accuracy of the model, [17,18] combines the deep features of different layers with semantic and spatial information while [7] combines hand-craft and deep features to enhance the discriminative ability of the model.…”
Section: Correlation Filter-based Methodsmentioning
confidence: 99%
“…The two-dimensional max-pooling engine is implemented by employing the multiple one-dimensional rank-tracking-based max-pooling engines, as depicted in Figure 6. The block marked with "M H " represents the horizontal one-dimensional max-pooling engine shown in Equation (2). Specifically, y p (i,j) is obtained from the highest-ranking value r 0 of the ranking-counting block illustrated in Figure 2.…”
Section: Multiplexer Switch (Ms) Multiplexermentioning
confidence: 99%
“…Convolutional neural networks (CNNs) have demonstrated remarkable performance in various domains, including image classification, object detection, and speech recognition [1,2]. However, effectively integrating CNNs into embedded systems with limited power and size requirements remains a significant challenge.…”
Section: Introductionmentioning
confidence: 99%
“…Subsequently, several novel convolution kernel design methods have been proposed in quick succession. Howard et al [37] proposed Depthwise Separable Convolution, which integrates traditional convolution into two steps, namely depthwise convolution and pointwise convolution, which greatly improves the calculation efficiency [38][39][40]. Zhang et al [41] introduced group convolution, then followed by an operation of channel shuffling.…”
Section: Pedestrian Detection With Cnnsmentioning
confidence: 99%