Y. H. Li scite author profile

The exponential growth of the internet has led to a great deal of interest in developing useful and efficient tools and software to assist users in searching the Web. Document retrieval, categorization, routing and filtering can all be formulated as classification problems. However, the complexity of natural languages and the extremely high dimensionality of the feature space of documents have made this classification problem very difficult. We investigate four different methods for document classification: the naive Bayes classifier, the nearest neighbour classifier, decision trees and a subspace method. These were applied to seven-class Yahoo news groups (business, entertainment, health, international, politics, sports and technology) individually and in combination. We studied three classifier combination approaches: simple voting, dynamic classifier selection and adaptive classifier combination. Our experimental results indicate that the naive Bayes classifier and the subspace method outperform the other two classifiers on our data sets. Combinations of multiple classifiers did not always improve the classification accuracy compared to the best individual classifier. Among the three different combination approaches, our adaptive classifier combination method introduced here performed the best. The best classification accuracy that we are able to achieve on this seven-class problem is approximately 83%, which is comparable to the performance of other similar studies. However, the classification problem considered here is more difficult because the pattern classes used in our experiments have a large overlap of words in their corresponding documents.

show abstract

Data mining of social media for urban resilience study: A case of rainstorm in Xi'an

Zhou

et al. 2023

International Journal of Disaster Risk Reduction

View full text Add to dashboard Cite

Compact Dominant Synergistic Excitation Pattern Learning for Illumination-Insensitive Image Representation With Boosting

et al. 2019

View full text Add to dashboard Cite

Illumination-insensitive image representation is a great challenge in the computer vision field. Illumination variations considerably obstruct the effectiveness of image feature extraction. In this paper, we present a novel and generalized learning framework for illumination-insensitive image representation, which can learn the discriminative features through maximizing the inter-difference and minimizing intradifference of the images with boosting. Particularly, we enhance the discriminative capacity of illuminationinsensitive image representation in three aspects. First, we learn a subset of different synergistic Weber excitation patterns (SWEP) to generate the dominant SWEP (DSWEP) and DSWEP codebook for exploring optimal illumination-insensitive patterns. Second, a compact DSWEP (C-DSWEP) is learned with a boosted set of weight to generate C-DSWEP codebook. Discriminative learning is aimed at robustness and compactness. Third, the discriminative histogram learning model is established for encoding CDSEP to further improve the discriminative ability and reduce redundancy. The extensive experiments on CMUPIE, FERET, Yale B, Yale B ext., LFW, and PhoTex databases have highlighted the superiority and the robustness of our method compared with some other state-of-the-art methods.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Y. H. Li

Classification of Text Documents

Data mining of social media for urban resilience study: A case of rainstorm in Xi'an

Compact Dominant Synergistic Excitation Pattern Learning for Illumination-Insensitive Image Representation With Boosting

Contact Info

Product

Resources

About