Abstract:Over the past couple of years, tremendous progress has been made in applying deep learning (DL) techniques to computer vision. Especially, deep convolutional neural networks (DCNNs) have achieved state-of-the-art performance on standard recognition datasets and tasks such as ImageNet Large-Scale Visual Recognition Challenge (ILSVRC). Among them, GoogLeNet network which is a radically redesigned DCNN based on the Hebbian principle and scale invariance set the new state of the art for classification and detection in the ILSVRC 2014. Since there exist various deep learning techniques, this review paper is focusing on techniques directly related to DCNNs, especially those needed to understand the architecture and techniques employed in GoogLeNet network.
This paper proposes an extension of the weak classifiers derived from the Haar-like features for their use in the Viola-Jones object detection system. These weak classifiers differ from the traditional single threshold ones, in that no specific threshold is needed and these classifiers give a more general solution to the non-trivial task of finding thresholds for the Haar-like features. The proposed quadratic discriminant analysis based extension prominently improves the ability of the weak classifiers to discriminate objects and non-objects. The proposed weak classifiers were evaluated by boosting a single stage classifier to detect rear of car. The experiments demonstrate that the object detector based on the proposed weak classifiers yields higher classification performance with less number of weak classifiers than the detector built with traditional single threshold weak classifiers.
The death rate of pedestrians in car accidents in Korea is about 2.5 times higher than the average of OECD countries'. If a system that can detect pedestrians and send alarm to driver is built and reduces the rate, it is worth developing such a pedestrian detection system (PDS). Since the accident rate in which pedestrians are involved is higher at nighttime than in daytime, the adoption of nighttime PDS is being standardized by big auto companies. However, they are usually using expensive night visions or multiple sensors for their PDS. In this paper we propose a method for nighttime PDS using a monochrome visible spectrum camera. We could verify its superiority in both performance and real-time operation to existing algorithm through tests against video data taken in several different environments.
This paper presents a real-time detection of on-road succeeding vehicles based on low level edge features and a boosted cascade of Haar-like features. At first, the candidate vehicle location in an image is found by low level horizontal edge and symmetry characteristic of vehicle. Then a boosted cascade of the Haar-like features is applied to the initial hypothesized vehicle location to extract the refined vehicle location. The initial hypothesis generation using simple edge features speeds up the whole detection process and the application of a trained cascade on the hypothesized location increases the accuracy of the detection process. Experimental results on real world road scenario with processing speed of up to 27 frames per second for 720x480 pixel images are presented.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.