Sentiment Analysis of Short Informal Texts

Kiritchenko, Svetlana; Zhu, Xiaodan; Mohammad, Saif M.

doi:10.1613/jair.4272

Cited by 737 publications

(469 citation statements)

References 39 publications

Supporting

Mentioning

443

Contrasting

Order By: Relevance

“…The PSTN model, which takes into account the human-annotated prior sentiment of arguments, performs the best. This could suggest that additional external knowledge, e.g., that from human-built resources or automatically learned from other data (e.g., as in (Kiritchenko et al, 2014)), including sentiment that cannot be inferred from its constituent expressions, might be incorporated to benefit the current neural-network-based models as prior knowledge. Note that the two neural network based models incorporate the syntax and semantics by representing each node with a vector.…”

Section: Resultsmentioning

confidence: 99%

An Empirical Study on the Effect of Negation Words on Sentiment

Zhu¹,

Guo²,

Mohammad³

et al. 2014

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

View full text Add to dashboard Cite

Negation words, such as no and not, play a fundamental role in modifying sentiment of textual expressions. We will refer to a negation word as the negator and the text span within the scope of the negator as the argument. Commonly used heuristics to estimate the sentiment of negated expressions rely simply on the sentiment of argument (and not on the negator or the argument itself). We use a sentiment treebank to show that these existing heuristics are poor estimators of sentiment. We then modify these heuristics to be dependent on the negators and show that this improves prediction. Next, we evaluate a recently proposed composition model (Socher et al., 2013) that relies on both the negator and the argument. This model learns the syntax and semantics of the negator's argument with a recursive neural network. We show that this approach performs better than those mentioned above. In addition, we explicitly incorporate the prior sentiment of the argument and observe that this information can help reduce fitting errors.

show abstract

Section: Resultsmentioning

confidence: 99%

An Empirical Study on the Effect of Negation Words on Sentiment

Zhu¹,

Guo²,

Mohammad³

et al. 2014

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

View full text Add to dashboard Cite

show abstract

“…The popularity of Twitter as a social media platform on which people can readily express their thoughts, feelings, and opinions, coupled with the openness of the platform, provides a large amount of publicly accessible data ripe for analysis, being a well established domain for sentiment analysis as reflecting realworld attitudes (Pak and Paroubek, 2010;Bollen et al, 2011). In this paper, we look into Twitter sentiment analysis (TSA) as a suitable, core instance of general short-text sentiment analysis (Thelwall et al, 2010(Thelwall et al, , 2012Kiritchenko et al, 2014;Dos Santos and Gatti, 2014), and encourage the methods and practices presented to be applied across other domains. Building a TSA model that can automatically determine the sentiment of a tweet has received significant attention over the past several years.…”

Section: Tweet Text + -0mentioning

confidence: 99%

Sentiment Analysis: It’s Complicated!

Kenyon-Dean

Ahmed²,

Fujimoto

et al. 2018

Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu

View full text Add to dashboard Cite

Sentiment analysis is used as a proxy to measure human emotion, where the objective is to categorize text according to some predefined notion of sentiment. Sentiment analysis datasets are typically constructed with gold-standard sentiment labels, assigned based on the results of manual annotations. When working with such annotations, it is common for dataset constructors to discard "noisy" or "controversial" data where there is significant disagreement on the proper label. In datasets constructed for the purpose of Twitter sentiment analysis (TSA), these controversial examples can compose over 30% of the originally annotated data. We argue that the removal of such data is a problematic trend because, when performing real-time sentiment classification of short-text, an automated system cannot know a priori which samples would fall into this category of disputed sentiment. We therefore propose the notion of a "complicated" class of sentiment to categorize such text, and argue that its inclusion in the short-text sentiment analysis framework will improve the quality of automated sentiment analysis systems as they are implemented in real-world settings. We motivate this argument by building and analyzing a new publicly available TSA dataset of over 7,000 tweets annotated with 5x coverage, named MTSA. Our analysis of classifier performance over our dataset offers insights into sentiment analysis dataset and model design, how current techniques would perform in the real world, and how researchers should handle difficult data.

show abstract

“…There has been considerable research focusing on sentiment analysis of short texts (Thelwall et al, 2010;Kiritchenko et al, 2014), especially within recent SemEval campaigns (Nakov et al, 2016;Rosenthal et al, 2015Rosenthal et al, , 2014. A large body of recent work focuses on sentence-level sentiment prediction.…”

Section: Related Workmentioning

confidence: 99%

TakeLab at SemEval-2017 Task 5: Linear aggregation of word embeddings for fine-grained sentiment analysis of financial news

Rotim¹,

Tutek

Šnajder

2017

Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)

View full text Add to dashboard Cite

This paper describes our system for finegrained sentiment scoring of news headlines submitted to SemEval 2017 task 5, subtask 2. Our system uses a feature-light method that consists of a Support Vector Regression (SVR) with various kernels and word embedding vectors as features. Our best-performing submission scored 3rd on the task out of 29 teams and 4th out of 45 submissions, with a cosine score of 0.733.

show abstract

Sentiment Analysis of Short Informal Texts

Cited by 737 publications

References 39 publications

An Empirical Study on the Effect of Negation Words on Sentiment

An Empirical Study on the Effect of Negation Words on Sentiment

Sentiment Analysis: It’s Complicated!

TakeLab at SemEval-2017 Task 5: Linear aggregation of word embeddings for fine-grained sentiment analysis of financial news

Contact Info

Product

Resources

About