2017
DOI: 10.1109/tnnls.2016.2586194
|View full text |Cite
|
Sign up to set email alerts
|

A Biologically Inspired Appearance Model for Robust Visual Tracking

Abstract: Abstract-In this work, we propose a biologically inspired appearance model for robust visual tracking. Motivated in part by the success of the hierarchical organization of the primary visual cortex (area V1), we establish an architecture consisting of five layers: whitening, rectification, normalization, coding and polling. The first three layers stem from the models developed for object recognition. In this paper, our attention focuses on the coding and pooling layers. In particular, we use a discriminative s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
42
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 86 publications
(42 citation statements)
references
References 66 publications
0
42
0
Order By: Relevance
“…The initial appearance is preserved in a network model if the target samples drawn from the first frame are put into consideration when updated, which is helpful for target re-detection after its reappearance after temporary disappearance. Pessimistically believed in [46], from the author's point of view, only the positive sample from the first frame is completely reliable, whereas contamination and decision mistake must exist in other frames to some extent, which is also deemed to be true in [47] that there must exist error a bit or too much in each frame, except in the first one. However, optimistically speaking, thanks to the close appearances from the two adjacent frames, a trend of the variation can be foreseen within a small period (no above than three frames); therefore, there exists a high confidence of making sure whether the tracking result is responsible.…”
Section: Update Strategy Of Neural Network Modelsmentioning
confidence: 99%
“…The initial appearance is preserved in a network model if the target samples drawn from the first frame are put into consideration when updated, which is helpful for target re-detection after its reappearance after temporary disappearance. Pessimistically believed in [46], from the author's point of view, only the positive sample from the first frame is completely reliable, whereas contamination and decision mistake must exist in other frames to some extent, which is also deemed to be true in [47] that there must exist error a bit or too much in each frame, except in the first one. However, optimistically speaking, thanks to the close appearances from the two adjacent frames, a trend of the variation can be foreseen within a small period (no above than three frames); therefore, there exists a high confidence of making sure whether the tracking result is responsible.…”
Section: Update Strategy Of Neural Network Modelsmentioning
confidence: 99%
“…The former focuses on modeling the appearance of the tracked target and then finds the candidate that is the most similar to the target template as the tracking result. The representative methods include those trackers based on sparse representation [23][24][25][26][27][28][29]. In [29], sparse coding is used to extract features from sampled patches.…”
Section: Related Workmentioning
confidence: 99%
“…Since features from different cues describe the tracked target from different aspects, more robust tracking results can be obtained when multi-cue features are used. In [23], a biologically inspired appearance model is proposed to model target appearance, which is also based on features extracted using sparse coding.…”
Section: Related Workmentioning
confidence: 99%
“…Later, that model is used to estimate affinities between the objects across different frames. The approaches following this pipeline exploit several distinct types of representation models, including; appearance models [10], [11], [12], [13], motion models [14], [15], [16], [17], and composite models [9], [18], [19]. The appearance models focus on computing easy-to-track object features that encode appearances of local regions of objects or their bounding boxes [20], [21], [22].…”
Section: Introductionmentioning
confidence: 99%