Sentiment lexicon learning is of paramount importance in sentiment analysis. One of the most considerable challenges in learning sentiment lexicons is their domain-specific behavior. Transferring knowledge acquired from a sentiment lexicon from one domain to another is an open research problem. In this study, we attempt to address this challenge by presenting a transfer learning approach that creates new learning insights for multiple domains of the same genre. We propose an unsupervised sentiment lexicon learning methodology scalable to new domains of the same genre. Incremental learning and the methodology learn polarity seed words from corpora of multiple automatically selected source domains. This process then transfers its genre-level knowledge of corpus-learned seed words to the target domains. The corpus-learned seed words are used for sentiment lexicon generation for multiple target domains of the same genre. The sentiment lexicon learning process is based on the latent semantic analysis technique and uses unlabeled training data from the source and target domains. The experiment was performed using 24 domains of the same genre, i.e., consumer product review. The proposed model displays the best results using standard evaluation measures compared with the competitive baselines. The proposed genre-based unsupervised approach achieves a maximum accuracy of 86% and outperforms methods recently presented in the literature.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.