2011
DOI: 10.1007/978-94-007-1757-2_9
|View full text |Cite
|
Sign up to set email alerts
|

Sentiment Analysis Using Automatically Labelled Financial News Items

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 4 publications
0
4
0
Order By: Relevance
“…An n-gram model is a sequence of 𝑛 consecutive symbols that can be characters, words, bytes, or any other continuous symbols. For instance, a 1-gram (or unigram) is one-symbol, a 2-symbol (bigram) is a two-symbol sequence of symbols, and 3-gram (trigram) is a three-symbol sequence of symbols so on and so forth (Généreux et al, 2011;Zhai et al, 2011). It uses the previous 𝑛 − 1 symbol to predict the next 𝑝(𝑡 𝑛 |𝑡 𝑛−1 ).…”
Section: N-gram Modelmentioning
confidence: 99%
See 2 more Smart Citations
“…An n-gram model is a sequence of 𝑛 consecutive symbols that can be characters, words, bytes, or any other continuous symbols. For instance, a 1-gram (or unigram) is one-symbol, a 2-symbol (bigram) is a two-symbol sequence of symbols, and 3-gram (trigram) is a three-symbol sequence of symbols so on and so forth (Généreux et al, 2011;Zhai et al, 2011). It uses the previous 𝑛 − 1 symbol to predict the next 𝑝(𝑡 𝑛 |𝑡 𝑛−1 ).…”
Section: N-gram Modelmentioning
confidence: 99%
“…Most of the current studies (Koppel and Shtrimberg, 2006;Groth and Muntermann, 2011;Yu et al, 2013) on sentiment-based financial news analysis typically rely on simple frequency-based textual presentations, such as Bag-of-Words (BoW) in which each piece of news is represented by the occurrence frequencies of distinct words. Some other research works (Généreux et al, 2011;Zhai et al, 2011), in contrast, have employed unigram text characterization, which shows similar characteristics to the BoW approach thanks to their commonality in consideration of occurrence frequency. However, in coping with a large volume of data, the process is encountered with a lot of the low frequency bigrams which can be considered as an informative and sentiment feature while the extraction of words is based on their high frequency which leads to ignore low frequency-based linguist features that can be worth to sentiment classification [9].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In a similar vein, Généreux et al (2011) investigate the impact of financial news items on the stock price of companies. They treat short financial news snippets about companies as if they were carrying implicit sentiment about the future market direction made explicit by the vocabulary they employ.…”
Section: Related Workmentioning
confidence: 99%