2020
DOI: 10.22266/ijies2020.0229.27
|View full text |Cite
|
Sign up to set email alerts
|

Credibility Detection in Twitter Using Word N-gram Analysis and Supervised Machine Learning Techniques

Abstract: With the evolution of social media platforms, the Internet is used as a source for obtaining news about current events. Recently, Twitter has become one of the most popular social media platforms that allows public users to share the news. The platform is growing rapidly especially among young people who may be influenced by the information from anonymous sources. Therefore, predicting the credibility of news in Twitter becomes a necessity especially in the case of emergencies. This paper introduces a classifi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
20
0
4

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 36 publications
(24 citation statements)
references
References 21 publications
0
20
0
4
Order By: Relevance
“…We examined frequent word pairs (bigrams) for each facet to qualitatively validate that tweets were related to the facet they were assigned to as has been done in other Twitter-based studies as they offer more insight into the sentiment of tweets than examining unigrams on their own [18] , [19] , [20] . A representative sample is shown in Appendix B.…”
Section: Methodsmentioning
confidence: 99%
“…We examined frequent word pairs (bigrams) for each facet to qualitatively validate that tweets were related to the facet they were assigned to as has been done in other Twitter-based studies as they offer more insight into the sentiment of tweets than examining unigrams on their own [18] , [19] , [20] . A representative sample is shown in Appendix B.…”
Section: Methodsmentioning
confidence: 99%
“…In this section, we compare the performance of the proposed model with three models existing in the literature. The first model was introduced by Noha Hassan et al [42] who introduces a classification model based on supervised machine learning techniques and word-based N-gram analysis to automatically classify Twitter messages into credible and non-credible. The results in Table 5 show that this model has an accuracy of 84.9%.…”
Section: Comparison (Vssyntax-based Methods)mentioning
confidence: 99%
“…in Facebook also , Fong et al [35] have taken into consideration the analysis of the avatar on a profile, sex, age, and the name. Noha et al [42], introduces a classification model based on supervised machine learning techniques and word-based N-gram analysis to classify Twitter messages automatically into credible and not credible. The best performance is achieved using a combination of both unigrams and bigrams, LSVM as a classifier and TF-IDF as a feature extraction technique.…”
Section: Related Workmentioning
confidence: 99%
“…Using a Kaggle data collection, the authors investigated to label each of the tweets as positive, negative or neutral sentiment. A classification algorithm focused on supervised ML techniques and word-based N-gram processing to automatically divide Twitter messages into credible and not credible ones introduced by (Hassan et al, 2020). Five different supervised ML classification techniques were applied and the research examines two interpretations of features (TF and TF-IDF) and separate sets of N-gram terms.…”
Section: Literature Reviewmentioning
confidence: 99%