2023
DOI: 10.3390/electronics12122662
|View full text |Cite
|
Sign up to set email alerts
|

Policy-Based Spam Detection of Tweets Dataset

Abstract: Spam communications from spam ads and social media platforms such as Facebook, Twitter, and Instagram are increasing, making spam detection more popular. Many languages are used for spam review identification, including Chinese, Urdu, Roman Urdu, English, Turkish, etc.; however, there are fewer high-quality datasets available for Urdu. This is mainly because Urdu is less extensively used on social media networks such as Twitter, making it harder to collect huge volumes of relevant data. This paper investigates… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 32 publications
0
2
0
Order By: Relevance
“…The SL-based approach needs training data (i.e., the features of untruthful reviews) in order to construct a model, like a neural network. Studies [31,35] used manually labeled training sets to construct a model. By using the trained model in [31], the classifier could effectively identify brand-only and non-review spam; however, the precision rate of recognizing untruthful reviews was low.…”
Section: Review Spam Detectionmentioning
confidence: 99%
“…The SL-based approach needs training data (i.e., the features of untruthful reviews) in order to construct a model, like a neural network. Studies [31,35] used manually labeled training sets to construct a model. By using the trained model in [31], the classifier could effectively identify brand-only and non-review spam; however, the precision rate of recognizing untruthful reviews was low.…”
Section: Review Spam Detectionmentioning
confidence: 99%
“…Advancements in text representation techniques such as TF-IDF [9,10], word embeddings, and n-grams [11][12][13][14][15][16] have contributed to more robust feature representations of text data. These approaches help in capturing more nuanced information from messages, improving the classifier's ability to discern between spam and non-spam emails.…”
Section: Introductionmentioning
confidence: 99%