Automatic 1 monitoring for the assessment of pain can significantly improve the psychological comfort of patients. Recently introduced databases with expert annotation opened the way for pain intensity estimation from facial analysis. In this contribution, pivotal face elements are identified using the Histograms of Topographical features (HoT) which are a generalization of the topographical primal sketch. In order to improve the discrimination between different pain intensity values and respectively the generalization with respect to the monitored persons, we transfer data representation from the emotion oriented Cohn-Kanade database to the UNBC McMaster Shoulder Pain database.
This paper focuses on the possible embedding of the uncertainty regarding the colors of an image into histogramtype descriptors. The uncertainty naturally arises from both the quantization of the color components and the human perception of colors. Fuzzy histograms measure the typicality of each color within the image. We define various fuzzy color histograms following a taxonomy that classifies fuzzy techniques as crude fuzzy, fuzzy paradigm based, fuzzy aggregational and fuzzy inferential. For these fuzzy sets we must develop appropriate similarity measures and distances. We propose a class of such distances, derived from the fuzzy set equality and which we particularize according to various T-norms (fuzzy logical "or" operators). We also prove that the L1 metric naturally arises as a distance for fuzzy sets, considering the fuzzy set symmetric difference.
Abstract. In this paper, we propose an audio-visual approach to video genre categorization. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At temporal structural level, we asses action contents with respect to human perception. Further, color perception is quantified with statistics of color distribution, elementary hues, color properties and relationship of color. The last category of descriptors determines statistics of contour geometry. An extensive evaluation of this multi-modal approach based on on more than 91 hours of video footage is presented. We obtain average precision and recall ratios within [87% − 100%] and [77% − 100%], respectively, while average correct classification is up to 97%. Additionally, movies displayed according to feature-based coordinates in a virtual 3D browsing environment tend to regroup with respect to genre, which has potential application with real content-based browsing systems.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.