2019
DOI: 10.1109/access.2019.2922494
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Weighted CNN Features Integration for Correlation Filter Tracking

Abstract: Visual object tracking is an active and challenging research topic in computer vision, as objects often undergo significant appearance changes caused by occlusion, deformation, and background clutter. Although convolutional neural network (CNN)-based trackers have achieved competitive results, there are still some limitations. Most existing CNN-based trackers track the object by leveraging high-level semantic features of the highest convolutional layer, which may lead to low-spatial resolution feature maps and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2019
2019
2025
2025

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 51 publications
0
8
0
Order By: Relevance
“…Therefore, most previous attention models, e.g. [22], [41], [42], and [53], use either global average or max pooling with a multilayer perceptron (MLP) to calculate their gain. In contrast, rather than a single pooling operation, we consider the global average and max pooling together to construct a channel attention module that learns fused features.…”
Section: ) Channel Attentionmentioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, most previous attention models, e.g. [22], [41], [42], and [53], use either global average or max pooling with a multilayer perceptron (MLP) to calculate their gain. In contrast, rather than a single pooling operation, we consider the global average and max pooling together to construct a channel attention module that learns fused features.…”
Section: ) Channel Attentionmentioning
confidence: 99%
“…Previously, Qin et al [54] constructed a spatial mask using super-pixels to exploit target representation. Li et al [53] utilized global max pooling to encode the spatial attention in their model. We exploit the relationship among channels inter-spatial features to construct spatial attention.…”
Section: ) Channel Attentionmentioning
confidence: 99%
“…The global and local features can be obtained by controlling the side length of the triangular. Convolution neural network (CNN) has also been adopted in shape matching and retrieval [30], [31]. Li and Yang [31] designed an adaptive weighted CNN features-based Siamese network.…”
Section: Related Workmentioning
confidence: 99%
“…Convolution neural network (CNN) has also been adopted in shape matching and retrieval [30], [31]. Li and Yang [31] designed an adaptive weighted CNN features-based Siamese network. In this method, the feature extraction network is designed to derive feature maps while the feature integration network is adopted to adaptively assign weights to each region of the feature maps and integrate feature maps.…”
Section: Related Workmentioning
confidence: 99%
“…C ONVOLUTIONAL neural networks (CNNs) are attracting considerable attention in an increasing array of area, such as computer vision [1]- [3], computational acoustics [4]- [6] and natural language processing [7]- [9]. The general trend is to design deeper and more complicated network architecture to pursue better performance.…”
Section: Introductionmentioning
confidence: 99%