This paper presents a technique for ear detection from 2D profile face images that is capable of significantly reducing the false positives. In an ear biometrics system, recognition performance highly depends on the performance of the ear detection module. The trade-off between the complexity and the false positive detection is one of the essential component, where the complexity of a system increases proportionally to achieve a zero false positive rate detection.In literature, available ear detection techniques based on handcrafted features face challenges with low-quality acquired images affected by illumination, occlusion, and pose variations. We propose an ear detection technique using ensemble of convolutional neural network (CNN). The first part of the technique trains three models of CNN on a given dataset, whereas in later part, weighted average of the outputs of trained models is utilized to detect the ear regions. The used ensemble models show better performance as compared to the case when each individual model is used standalone. The proposed technique is being evaluated on two databases, viz, IIT Indore-Collection A (IIT-Col A) database and annotated web ear (AWE) database. Experimental results of ear detection demonstrate the superior performance of the proposed technique over other state-of-the-art techniques in handling illumination, occlusion, and pose variations. KEYWORDS deep learning, ear detection, ensemble model, unconstrained environment
INTRODUCTIONIn recent years, ear biometrics has been considered a reliable biometrics and has got attention over face mainly due to its invariant behavior to the age of a person. Biometrics systems based on face have to consider the changes occurring in the face with a change in age or expression.The ear of a person remains consistent in its shape and it is fixed in its position at the center of the profile image. To obtain reliable performance, face images are required to be captured in the controlled setting, while in case of ear, it can be captured with a foreseeable background due to its fixed side face background. An ear biometric system mainly consists of two processes, ie, Ear detection and Ear recognition. Majority of the ear biometrics techniques in the literature use manually cropped ear images to perform ear recognition process. Furthermore, the ear biometric techniques that crops ear automatically are able to detect ear only when the ear has small variations in the side face image. In cases of pose or scale variations, these techniques are found to be inefficient. Additionally, in a real-time environment, these techniques lack in being fully automatic approaches and speed.Performance of an ear detection technique based on handcrafted feature learning methods depends on the dataset used for learning the features. 1 These methods are prone to produce erroneous results when used with the images captured in unconstrained settings like different illumination, background, and pose. Tasks like extraction of suitable features along with the discriminati...