Unveiling Deception in Arabic: Optimization of Deceptive Text Detection Across Formal and Informal Genres

Alhayan, Fatimah; Himdi, Hanen T.; Alharbi, Basma

doi:10.1109/access.2024.3424531

IEEE Access

2024

DOI: 10.1109/access.2024.3424531

|View full text |Cite

Unveiling Deception in Arabic: Optimization of Deceptive Text Detection Across Formal and Informal Genres

Fatimah Alhayan,

Hanen T. Himdi,

Basma Alharbi

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 52 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

VERA-ARAB: unveiling the Arabic tweets credibility by constructing balanced news dataset for veracity analysis

Mostafa,

Almogren

2024

PeerJ Computer Science

View full text Add to dashboard Cite

The proliferation of fake news on social media platforms necessitates the development of reliable datasets for effective fake news detection and veracity analysis. In this article, we introduce a veracity dataset of Arabic tweets called “VERA-ARAB”, a pioneering large-scale dataset designed to enhance fake news detection in Arabic tweets. VERA-ARAB is a balanced, multi-domain, and multi-dialectal dataset, containing both fake and true news, meticulously verified by fact-checking experts from Misbar. Comprising approximately 20,000 tweets from 13,000 distinct users and covering 884 different claims, the dataset includes detailed information such as news text, user details, and spatiotemporal data, spanning diverse domains like sports and politics. We leveraged the X API to retrieve and structure the dataset, providing a comprehensive data dictionary to describe the raw data and conducting a thorough statistical descriptive analysis. This analysis reveals insightful patterns and distributions, visualized according to data type and nature. We also evaluated the dataset using multiple machine learning classification models, exploring various social and textual features. Our findings indicate promising results, particularly with textual features, underscoring the dataset’s potential for enhancing fake news detection. Furthermore, we outline future work aimed at expanding VERA-ARAB to establish it as a benchmark for Arabic tweets in fake news detection. We also discuss other potential applications that could leverage the VERA-ARAB dataset, emphasizing its value and versatility for advancing the field of fake news detection in Arabic social media. Potential applications include user veracity assessment, topic modeling, and named entity recognition, demonstrating the dataset's wide-ranging utility for broader research in information quality management on social media.

show abstract

VERA-ARAB: unveiling the Arabic tweets credibility by constructing balanced news dataset for veracity analysis

Mostafa,

Almogren

2024

PeerJ Computer Science

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Unveiling Deception in Arabic: Optimization of Deceptive Text Detection Across Formal and Informal Genres

Cited by 1 publication

References 52 publications

VERA-ARAB: unveiling the Arabic tweets credibility by constructing balanced news dataset for veracity analysis

VERA-ARAB: unveiling the Arabic tweets credibility by constructing balanced news dataset for veracity analysis

Contact Info

Product

Resources

About