2016
DOI: 10.48550/arxiv.1607.05781
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 14 publications
0
7
0
Order By: Relevance
“…In vision-based convoying, the system may lose sight of the object momentarily due to occlusion or lighting changes, and thus lose track of its leading agent. In an attempt to address this problem, we use recurrent layers stacked on top of our ReducedYOLO architecture, similarly to [26]. In their work, Ning et al use the last layer of features output by the YOLO network for n frames (concatenated with the YOLO bounding box prediction which has the highest IOU with the ground truth) and feed them to single forward Long-Term Short-Term Memory Network (LSTM).…”
Section: B Recurrent Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In vision-based convoying, the system may lose sight of the object momentarily due to occlusion or lighting changes, and thus lose track of its leading agent. In an attempt to address this problem, we use recurrent layers stacked on top of our ReducedYOLO architecture, similarly to [26]. In their work, Ning et al use the last layer of features output by the YOLO network for n frames (concatenated with the YOLO bounding box prediction which has the highest IOU with the ground truth) and feed them to single forward Long-Term Short-Term Memory Network (LSTM).…”
Section: B Recurrent Methodsmentioning
confidence: 99%
“…2: Overview of our Recurrent ReducedYOLO (RROLO) architecture. The original ROLO work [26] did not use bidirectional, dense layers, or multiple LSTM cells in their experiments.…”
Section: B Recurrent Methodsmentioning
confidence: 99%
“…Another related work to ours is [21]. They proposed a spatially supervised recurrent convolutional neural network in which a YOLO network [22] is applied on each frame to produce object detections and a recurrent neural network is used to directly regress YOLO detections.…”
Section: Visual Object Trackingmentioning
confidence: 99%
“…More recently, as deep learning models have gained in popularity, solutions such as the gradient normal clipping strategy (Pascanu et al, 2013) have eased the overall implementation burden of RNNs. As RNNs have become more manageable from an estimation perspective, they have increasingly been used to model complicated sequential forecasting problems such as visual object tracking (Ning et al, 2016), speech recognition (Yildiz et al, 2013), and text generation (Graves, 2013), just to name a few. Simultaneously, RNNs have also seen a rise in usage in the dynamic systems literature due to their ability to replicate complex attractor dynamics that are often present in chaotic systems (Jaeger, 2001).…”
Section: Introductionmentioning
confidence: 99%