Multimedia content analysis is applied in different real-world computer vision applications, and digital images constitute a major part of multimedia data. In last few years, the complexity of multimedia contents, especially the images, has grown exponentially, and on daily basis, more than millions of images are uploaded at different archives such as Twitter, Facebook, and Instagram. To search for a relevant image from an archive is a challenging research problem for computer vision research community. Most of the search engines retrieve images on the basis of traditional text-based approaches that rely on captions and metadata. In the last two decades, extensive research is reported for content-based image retrieval (CBIR), image classification, and analysis. In CBIR and image classification-based models, high-level image visuals are represented in the form of feature vectors that consists of numerical values. The research shows that there is a significant gap between image feature representation and human visual understanding. Due to this reason, the research presented in this area is focused to reduce the semantic gap between the image feature representation and human visual understanding. In this paper, we aim to present a comprehensive review of the recent development in the area of CBIR and image representation. We analyzed the main aspects of various image retrieval and image representation models from low-level feature extraction to recent semantic deep-learning approaches. The important concepts and major research studies based on CBIR and image representation are discussed in detail, and future research directions are concluded to inspire further research in this area.
Recently, face datasets containing celebrities photos with facial makeup are growing at exponential rates, making their recognition very challenging. Existing face recognition methods rely on feature extraction and reference reranking to improve the performance. However face images with facial makeup carry inherent ambiguity due to artificial colors, shading, contouring, and varying skin tones, making recognition task more difficult. The problem becomes more confound as the makeup alters the bilateral size and symmetry of the certain face components such as eyes and lips affecting the distinctiveness of faces. The ambiguity becomes even worse when different days bring different facial makeup for celebrities owing to the context of interpersonal situations and current societal makeup trends. To cope with these artificial effects, we propose to use a deep convolutional neural network (dCNN) using augmented face dataset to extract discriminative features from face images containing synthetic makeup variations. The augmented dataset containing original face images and those with synthetic make up variations allows dCNN to learn face features in a variety of facial makeup. We also evaluate the role of partial and full makeup in face images to improve the recognition performance. The experimental results on two challenging face datasets show that the proposed approach can compete with the state of the art.
The requirement for effective image search, which motivates the use of Content-Based Image Retrieval (CBIR) and the search of similar multimedia contents on the basis of user query, remains an open research problem for computer vision applications. The application domains for Bag of Visual Words (BoVW) based image representations are object recognition, image classification and content-based image analysis. Interest point detectors are quantized in the feature space and the final histogram or image signature do not retain any detail about co-occurrences of features in the 2D image space. This spatial information is crucial, as it adversely affects the performance of an image classification-based model. The most notable contribution in this context is Spatial Pyramid Matching (SPM), which captures the absolute spatial distribution of visual words. However, SPM is sensitive to image transformations such as rotation, flipping and translation. When images are not well-aligned, SPM may lose its discriminative power. This paper introduces a novel approach to encoding the relative spatial information for histogram-based representation of the BoVW model. This is established by computing the global geometric relationship between pairs of identical visual words with respect to the centroid of an image. The proposed research is evaluated by using five different datasets. Comprehensive experiments demonstrate the robustness of the proposed image representation as compared to the state-of-the-art methods in terms of precision and recall values.
Recognition of age-separated face images is a challenging and open research problem. In this paper we propose a facial asymmetry based matching-score space (MSS) approach for recognition of age-separated face images.Motivated by its discriminatory information, we evaluate facial asymmetry across small and large temporal variations and use asymmetric facial features to recognize age-separated face images. We extract three different facial features including holistic feature descriptors using Principal Component Analysis (PCA), local feature descriptors using Local Binary Patterns (LBP), and Densely Sampled Asymmetric Features (DSAF) to represent face images. Then we develop MSS to discriminate genuine and imposter classes using support vector machine (SVM) as a classifier. Experimental results on three widely used face aging databases, the FERET, MORPH and FG-NET, show that proposed approach has superior performance compared to some existing state-of-the-art approaches.
India, Pakistan, and Bangladesh (IPB) are the largest South Asian countries in terms of land area, gross domestic product (GDP), and population. The growth in these countries is impacted by inadequate renewable energy policy and implementation over the years, resulting in slow progress towards human development and economic sustainability. These developing countries are blessed with huge potential for renewable energy resources; however, they still heavily rely on fossil fuels (93%). IPB is a major contributor to the total energy consumption of the world and its most energy-intensive building sector (India 47%, Pakistan 55% and Bangladesh 55%) displays inadequate energy performance. This paper comprehensively reviews the energy mix and consumption in IPB with special emphasis on current policies and its impact on economic and human development. The main performance indicators have been critically analyzed for the period 1970–2017. The strength of this paper is a broad overview on energy and development of energy integration in major South Asian countries. Furthermore, it presents a broad deepening on the main sector of energy consumption, i.e., the building sector. The paper also particularly analyzes the existing buildings energy efficiency codes and policies, with specific long-term recommendations to improve average energy consumption per person. The study also examines the technical and regulatory barriers and recommends specific measures to adapt renewable technologies, with special attention to policies affecting energy consumption. The analysis and results are general and can be applied to other developing countries of the world.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.