2020
DOI: 10.3390/info11070341
|View full text |Cite
|
Sign up to set email alerts
|

Real-Time Tweet Analytics Using Hybrid Hashtags on Twitter Big Data Streams

Abstract: Twitter is a microblogging platform that generates large volumes of data with high velocity. This daily generation of unbounded and continuous data leads to Big Data streams that often require real-time distributed and fully automated processing. Hashtags, hyperlinked words in tweets, are widely used for tweet topic classification, retrieval, and clustering. Hashtags are used widely for analyzing tweet sentiments where emotions can be classified without contexts. However, regardless of the wide usage of hashta… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(14 citation statements)
references
References 46 publications
0
12
0
Order By: Relevance
“…Figure 8 shows the overall throughput for the three datasets; the y-axis shows throughput percentage and the xaxis shows the number of triads in percentage. e throughput (T) is measured using (5). We used a relative throughput measure to cross-relate the results.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Figure 8 shows the overall throughput for the three datasets; the y-axis shows throughput percentage and the xaxis shows the number of triads in percentage. e throughput (T) is measured using (5). We used a relative throughput measure to cross-relate the results.…”
Section: Resultsmentioning
confidence: 99%
“…ese include refinement of recommendation systems, fake user identification, analysis of micro blogging, detection of natural disasters using real-time Twitter Big Data, business decision making, and healthcare systems [1][2][3][4][5]. Companies and businesses increase revenues and improve goodwill by maintaining their micro blogging systems.…”
Section: Introductionmentioning
confidence: 99%
“…Annotating data is labor-intensive and several solutions have been proposed to reduce the coding work to a minimum; such as employing labeled data from a related task but different corpus, or using hashtags or well-defined keywords as annotations instead of human codings (Hasan et al, 2014;Gupta & Hewett, 2020). Next to that, semi-supervised learning (Van Engelen & Hoos, 2020) and transfer learning (Terechshenko et al, 2020) can be relevant to train a classifier when labeled data is scarce.…”
Section: Introductionmentioning
confidence: 99%
“…Twitter is one of the most accessed social media, where mass people express their lifestyle and thoughts. The daily count of tweets reaches the 500 million mark regularly, which involves texts, images, and videos [6]. This high volume and variety of data makes tweets as a candidate for unstructured big data source.…”
Section: Introductionmentioning
confidence: 99%