Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) 2014
DOI: 10.3115/v1/p14-2071
|View full text |Cite
|
Sign up to set email alerts
|

Improving Twitter Sentiment Analysis with Topic-Based Mixture Modeling and Semi-Supervised Training

Abstract: In this paper, we present multiple approaches to improve sentiment analysis on Twitter data. We first establish a state-of-the-art baseline with a rich feature set. Then we build a topic-based sentiment mixture model with topic-specific data in a semi-supervised training framework. The topic information is generated through topic modeling based on an efficient implementation of Latent Dirichlet Allocation (LDA). The proposed sentiment model outperforms the top system in the task of Sentiment Analysis in Twitte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
47
0
3

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 71 publications
(50 citation statements)
references
References 13 publications
0
47
0
3
Order By: Relevance
“…So the training and testing dataset is downloaded from http://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html, training dataset consist of total 6828 positive and negative words. Testing data set contain free text reviews written by user towards particular product [8]. Figure2 shows system architecture which is divided into two phase .Training phase and testing phase.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…So the training and testing dataset is downloaded from http://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html, training dataset consist of total 6828 positive and negative words. Testing data set contain free text reviews written by user towards particular product [8]. Figure2 shows system architecture which is divided into two phase .Training phase and testing phase.…”
Section: Methodsmentioning
confidence: 99%
“…In this system using Machine learning approach which is divided into supervised ,unsupervised and semi supervised methods [8].Here supervised classification is being implemented which requires training and testing dataset. So the training and testing dataset is downloaded from http://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html, training dataset consist of total 6828 positive and negative words.…”
Section: Methodsmentioning
confidence: 99%
“…Characterizing an attitude as positive, negative or neutral toward a topic is known as sentiment analysis. Most of the contribution in the field focuses on finding sentiments in the tweet level [1,5,13,25], some of them suggest aggregating the sentiments as a simple sum [4,7,17,22,24], while the problem of the reputation of an entity has not been specifically addressed. Natural language processing is a well-established research area in computer science.…”
Section: Related Workmentioning
confidence: 99%
“…Machine learning techniques prove to be effective with sentiment analysis: a semi-supervised approach that uses an interpolation between a universal labeled training set as a base, processed with SVM, and then a topic-related unlabeled training set for enrichment, processed with LDA in [25] and naive Bayes that uses topic-related clusters in [22]. In order to augment the accuracy of the classifier, semantic sentiment analysis is used in [19].…”
Section: Related Workmentioning
confidence: 99%
“…The paper by Tang et al [27] shows a joint sentence-level segmentation and classification system. Latent Dirichlet Allocation (LDA) was used by Xiang and Zhou [28] in the creation of topic-specific information, before going on to divide the data into several subsets based on topic distribution. In the last wave, they presented a semi-supervised training system to further increase classification accuracy.…”
Section: A Sentiment Analysis In Genaralmentioning
confidence: 99%