Hannes Fassold scite author profile

The automatic detection and tracking of general objects (like persons, animals or cars), text and logos in a video is crucial for many video understanding tasks, and usually real-time processing as required. We propose OmniTrack, an efficient and robust algorithm which is able to automatically detect and track objects, text as well as brand logos in realtime. It combines a powerful deep learning based object detector (YoloV3) with high-quality optical flow methods. Based on the reference YoloV3 C++ implementation, we did some important performance optimizations which will be described. The major steps in the training procedure for the combined detector for text and logo will be presented. We will describe then the OmniTrack algorithm, consisting of the phases preprocessing, feature calculation, prediction, matching and update. Several performance optimizations have been implemented there as well, like doing the object detection and optical flow calculation asynchronously. Experiments show that the proposed algorithm runs in real-time for standard definition (720x576) video on a PC with a Quadro RTX 5000 GPU.

show abstract

A real-time GPU implementation of the SIFT algorithm for large-scale video analysis tasks

Fassold

Rosner

2015

View full text Add to dashboard Cite

FastHebb: Scaling Hebbian Training of Deep Neural Networks to ImageNet Level

Lagani

Gennaro

Fassold

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hannes Fassold

A Perceptual Image Sharpness Metric Based on Local Edge Gradient Analysis

Real-time Person Tracking in High-resolution Panoramic Video for Automated Broadcast Production

OmniTrack: Real-Time Detection and Tracking of Objects, Text and Logos in Video

A real-time GPU implementation of the SIFT algorithm for large-scale video analysis tasks

FastHebb: Scaling Hebbian Training of Deep Neural Networks to ImageNet Level

Contact Info

Product

Resources

About