On the Interaction Between Deep Detectors and Siamese Trackers in Video Surveillance

Kiran, Madhu; Tiwari, Vivek; Nguyen-Meidine, Le Thanh; Morin, L.; Granger, Éric

doi:10.1109/avss.2019.8909864

Cited by 1 publication

(4 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In contrast to existing works e.g. [28,32] that follow a single target our goal is to give the same priority to all targets and attempt to follow as many as possible without a specific focus a single target. In addition, it does not require to be given an anchor target to follow and thus can be used in generic scenarios where the goal is to monitor targets in an area.…”

Section: Visual Active Monitoringmentioning

confidence: 99%

“…Over the last years, deep neural networks, especially Convolutional Neural Networks (CNN), have improved the state-of-the-art in static object tracking/monitoring [24,28,35,37]. Conventional solutions for active visual tracking tackle the problem by decomposing it into two or more subtasks [21], i.e., object detection typically using a machinelearning-based classifier/detector, a tracking algorithms such as Kalman filter [7], and a control output for the camera movement.…”

Section: Visual Active Monitoringmentioning

confidence: 99%

“…In summary, it is evident from the literature that related works make excessive use of multiple modules composed of hand-crafted models and rules that must be tuned separately and in most cases track only a single target [28]. While there has been considerable progression in utilizing deep learning for static camera tracking there has been relatively few works dealing with deep learning for active smart camera systems.…”

Section: Visual Active Monitoringmentioning

confidence: 99%

“…Currently existing approaches for active vision decompose the problem into separate modules, namely detection, tracking, and control and employ different algorithms for the purpose of detecting targets and then following them [17,28]. Such examples include motion detection, background modelling/subtraction, and lastly tracking by detection.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

$$\text{C}^{3}\text{Net}$$: end-to-end deep learning for efficient real-time visual active camera control

Kyrkou

2021

J Real-Time Image Proc

View full text Add to dashboard Cite

The need for automated real-time visual systems in applications such as smart camera surveillance, smart environments, and drones necessitates the improvement of methods for visual active monitoring and control. Traditionally, the active monitoring task has been handled through a pipeline of modules such as detection, filtering, and control. However, such methods are difficult to jointly optimize and tune their various parameters for real-time processing in resource constraint systems. In this paper a deep Convolutional Camera Controller Neural Network is proposed to go directly from visual information to camera movement to provide an efficient solution to the active vision problem. It is trained end-to-end without bounding box annotations to control a camera and follow multiple targets from raw pixel values. Evaluation through both a simulation framework and real experimental setup, indicate that the proposed solution is robust to varying conditions and able to achieve better monitoring performance than traditional approaches both in terms of number of targets monitored as well as in effective monitoring time. The advantage of the proposed approach is that it is computationally less demanding and can run at over 10 FPS ( ∼ 4× speedup) on an embedded smart camera providing a practical and affordable solution to real-time active monitoring.

show abstract

Section: Visual Active Monitoringmentioning

confidence: 99%