2021
DOI: 10.1007/s10579-021-09555-3
|View full text |Cite
|
Sign up to set email alerts
|

A multi-source entity-level sentiment corpus for the financial domain: the FinLin corpus

Abstract: We introduce FinLin, a novel corpus containing investor reports, company reports, news articles, and microblogs from StockTwits, targeting multiple entities stemming from the automobile industry and covering a 3-month period. FinLin was annotated with a sentiment score and a relevance score in the range [− 1.0, 1.0] and [0.0, 1.0], respectively. The annotations also include the text spans selected for the sentiment, thus, providing additional insight into the annotators’ reasoning. Overall, FinLin aims to comp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 34 publications
0
4
0
Order By: Relevance
“…• FinLin [3]: Investor reports, company reports, news articles, and microblogs from multiple sources annotated using a two-dimensional scale of sentiment score and relevance score by three annotators with specific profiles. The annotations are consolidated either automatically or manually by an additional annotator using the same tool (AWOCATo).…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…• FinLin [3]: Investor reports, company reports, news articles, and microblogs from multiple sources annotated using a two-dimensional scale of sentiment score and relevance score by three annotators with specific profiles. The annotations are consolidated either automatically or manually by an additional annotator using the same tool (AWOCATo).…”
Section: Related Workmentioning
confidence: 99%
“…The study found evidence for the supremacy of the reader's perspective in terms of Inter-Annotator Agreement (IAA) and rating intensity, and achieved near-human performance when mapping between dimensional and categorical formats. Other works are [8], [9], [10], [11], [12], [13], [14], [15]. Some of the benefits of these methodologies are: They cover a variety of domains and topics, such as movie reviews, tweets, financial texts, news articles, and video data; they use different annotation schemes and tools to capture different aspects of sentiment, such as polarity, intensity, relevance, event type, sector, asset type, and transaction anonymity.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations