One class classification is a binary classification task for which only one class of samples is available for learning. In some preliminary works, we have proposed One Class Random Forests (OCRF), a method based on a random forest algorithm and an original outlier generation procedure that makes use of classifier ensemble randomization principles. In this paper, we propose an extensive study of the behavior of OCRF, that includes experiments on various UCI public datasets and comparison to reference one class algorithms -namely, gaussian density models, Parzen estimators, gaussian mixture models and One Class SVMs -with statistical significance. Our aim is to show that the randomization principles embedded in a random forest algorithm make the outlier generation process more efficient, and allow in particular to break the curse of dimensionality. One Class Random Forests are shown to perform well in comparison to other methods, and in particular to maintain stable performance in higher dimension, while the other algorithms may fail.
Recently, the in vivo imaging of pulmonary alveoli was made possible thanks to confocal microscopy. For these images, we wish to aid the clinician by developing a computer-aided diagnosis system, able to discriminate between healthy and pathological subjects. The lack of expertise currently available on these images has first led us to choose a generic approach, based on pixel-value description of randomly extracted subwindows and decision tree ensemble for classification (extra-trees). In order to deal with the great complexity of our images, we adapt this method by introducing a texture-based description of the subwindows, based on local binary patterns. We show through our experimental protocol that this adaptation is a promising way to classify fibered confocal fluorescence microscopy images. In addition, we introduce a rejection mechanism on the classifier output to prevent nondetection errors.
A novel imaging technique can now provide microscopic images of the distal lung in vivo, for which quantitative analysis tools need to be developed. In this paper, we present an image classification system that is able to discriminate between normal and pathological images. Different feature spaces for discrimination are investigated and evaluated using a support vector machine. Best classification rates reach up to 90% and 95% on non-smoker and smoker groups, respectively. A feature selection process is also implemented, that allows us to gain some insight about these images. Whereas further tests on extended databases are needed, these first results indicate that efficient computer based automated classification of normal vs. pathological images of the distal lung is feasible.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.