Attribute-based vehicle search in crowded surveillance videos

Feris, Rogério; Siddiquie, Behjat; Zhai, Yun; Petterson, James; Brown, Lisa M.; Pankanti, Sharath

doi:10.1145/1991996.1992014

Cited by 38 publications

(37 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The work of Feris et al [1], [5] also splits the training data into motionlet clusters to better deal with non-linearities in the dataset. Training in each cluster is done with largescale feature selection, but only few thousands of training examples are considered.…”

Section: Related Workmentioning

confidence: 99%

“…The first step of our learning algorithm consists of automatically partitioning the dataset into motionlet clusters [1], i.e., clusters of vehicle images that share similar 2D motion direction. The motion information of a vehicle is directly related to its pose, therefore this operation provides a semantic partitioning of the dataset.…”

Section: A Pool Of Complementary Detectorsmentioning

confidence: 99%

“…In this context, we are particularly interested in enabling automatic object retrieval based on attributes from surveillance videos, with focus on vehicles. We have built a search infrastructure similar to the work of Feris et al [1], involving video processing, database ingestion, and web service interface to support user queries such as "Show me all blue vehicles traveling at high speed northbound, last Saturday, from 2pm to 4pm". This paper addresses an important component of this framework: how to develop a robust and efficient approach for vehicle detection in surveillance videos, under varying pose and lighting changes.…”

Section: Introductionmentioning

confidence: 99%

“…Figure 1 shows the architecture of our proposed system. We use motionlets [1] to automatically split our training dataset into semantic partitions related to vehicle pose. For each partition, we create a set of compact, complementary detectors, each trained in a deep cascade structure, using hundreds of thousands of selected negative examples.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance

Feris¹,

Pankanti²,

Siddiquie

2012

2012 IEEE International Conference on Multimedia and Expo

Self Cite

View full text Add to dashboard Cite

Abstract-We address the problem of learning robust and efficient multi-view object detectors for surveillance video indexing and retrieval. Our philosophy is that effective solutions for this problem can be obtained by learning detectors from huge amounts of training data. Along this research direction, we propose a novel approach that consists of strategically partitioning the training set and learning a large array of complementary, compact, deep cascade detectors. At test time, given a video sequence captured by a fixed camera, a small number of detectors is automatically selected per image location. We demonstrate our approach on the problem of vehicle detection in challenging surveillance scenarios, using a large training dataset composed of around one million images. Our system runs at an impressive average rate of 125 frames per second on a conventional laptop computer.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: A Pool Of Complementary Detectorsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance

Feris¹,

Pankanti²,

Siddiquie

2012

2012 IEEE International Conference on Multimedia and Expo

Self Cite

View full text Add to dashboard Cite

show abstract

“…Specifically for applications that require real-time processing, cascade detectors based on Haar-like features have been widely used for detection of faces [22], pedestrians [23] and vehicles [6]. Although significant progress has been made in this area, state-of-the-art object detectors are still not able to generalize well to different camera angles and lighting conditions.…”

Section: Introductionmentioning

confidence: 99%

Boosting object detection performance in crowded surveillance videos

Feris

Datta

Pankanti

et al. 2013

2013 IEEE Workshop on Applications of Computer Vision (WACV)

Self Cite

View full text Add to dashboard Cite

We present a novel approach to automatically create efficient and accurate object detectors tailored to work well on specific video surveillance cameras (specific-domain detectors), using samples acquired with the help of a more expensive, general-domain detector (trained using images from multiple cameras). Our method requires no manual labels from the target domain. We automatically collect training data using tracking over short periods of time from high-confidence samples selected by the general-domain detector. In this context, a novel confidence measure is proposed for detectors based on a cascade of classifiers, which are frequently adopted for computer vision applications that require real-time processing. We demonstrate our proposed approach on the problem of vehicle detection in crowded surveillance videos, showing that an automatically generated detector significantly outperforms the original general-domain detector with much less feature computations.

show abstract