2019
DOI: 10.1002/spe.2724
|View full text |Cite
|
Sign up to set email alerts
|

A spark‐based big data analysis framework for real‐time sentiment prediction on streaming data

Abstract: Summary There are many data sources that produce large volumes of data. The Big Data nature requires new distributed processing approaches to extract the valuable information. Real‐time sentiment analysis is one of the most demanding research areas that requires powerful Big Data analytics tools such as Spark. Prior literature survey work has shown that, though there are many conventional sentiment analysis researches, there are only few works realizing sentiment analysis in real time. One major point that aff… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 32 publications
0
4
0
Order By: Relevance
“…A number of studies have combined two or more approaches for TSA such as combining machine-learning and lexicon-based approaches [3]. The work in [26] proposed a hybrid method by discussing a real-time sentiment analysis using Apache Spark's machine learning library, Hadoop distributed file system and streaming engine for sentiment prediction. The sentiment classification performance of the proposed system for offline and real-time modes were 86.77% and 80.93%, respectively.…”
Section: Hybrid-based Approachesmentioning
confidence: 99%
“…A number of studies have combined two or more approaches for TSA such as combining machine-learning and lexicon-based approaches [3]. The work in [26] proposed a hybrid method by discussing a real-time sentiment analysis using Apache Spark's machine learning library, Hadoop distributed file system and streaming engine for sentiment prediction. The sentiment classification performance of the proposed system for offline and real-time modes were 86.77% and 80.93%, respectively.…”
Section: Hybrid-based Approachesmentioning
confidence: 99%
“…Kılınç [23] has claimed that data produced by fake personalities affects the sentiment analysis model. In order to overcome this, a spark based big data framework, which incorporates a sentiment analysis method and a fake detection method has been proposed.…”
Section: Related Workmentioning
confidence: 99%
“…There are also some papers focusing on real-time sentiment analysis. Kilinc demonstrates that a considerable challenge of real-time sentiment analysis is the uncertainty of the reliability since there exist some fake accounts due to unethical reasons, he builds a spark-based real-time sentiment prediction framework that detects the authenticity of accounts before inputting data [7]. Moreover, SVM shows an accuracy of around 85% on real-time sentiment analysis of Amazon product reviews, there are four main steps in processing data, which are tokenization, removing stop words, POS tagging, and stemming [8].…”
Section: Literature Reviewmentioning
confidence: 99%