2017
DOI: 10.3390/a10010034
|View full text |Cite
|
Sign up to set email alerts
|

A Novel, Gradient Boosting Framework for Sentiment Analysis in Languages where NLP Resources Are Not Plentiful: A Case Study for Modern Greek

Abstract: Sentiment analysis has played a primary role in text classification. It is an undoubted fact that some years ago, textual information was spreading in manageable rates; however, nowadays, such information has overcome even the most ambiguous expectations and constantly grows within seconds. It is therefore quite complex to cope with the vast amount of textual data particularly if we also take the incremental production speed into account. Social media, e-commerce, news articles, comments and opinions are broad… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 40 publications
(23 citation statements)
references
References 32 publications
0
21
0
Order By: Relevance
“…It is a common phenomenon to have data imbalance on sentiment datasets collected through OSNs [5]. This is the case with our DFSMD dataset.…”
Section: Introductionmentioning
confidence: 92%
“…It is a common phenomenon to have data imbalance on sentiment datasets collected through OSNs [5]. This is the case with our DFSMD dataset.…”
Section: Introductionmentioning
confidence: 92%
“…Learning rate and number of trees are higher which leads to better performance but processing time also increased. Boosted decision tree was also used for sentiment analysis of Greek language which efficiently coped with both high dimensional and imbalanced datasets and achieves considerably enhanced then other traditional machine learning methods [61] as well as utilized for cardiovascular risk prediction [62] and risk prediction for inflammatory bowel disease [63]. Due to some limitations, decision forest was not given better results.…”
Section: Resultsmentioning
confidence: 99%
“…Cheng et al [139], [140], [141], [39], [142] addressed all problems related to the unlabelled data (i.e., unsupervised approach) and investigated the role played by the nonlexical and lexical features for testing and analysing the SA in various languages (like Turkish, Greek and English). The researchers used a dataset comprising of reviews and comments collected from social media networks like Facebook and Twitter.…”
Section: Future Research Directionmentioning
confidence: 99%