2017
DOI: 10.1177/0165551516684296
|View full text |Cite
|
Sign up to set email alerts
|

Feature engineering for detecting spammers on Twitter: Modelling and analysis

Abstract: Twitter is a social networking website that has gained a lot of popularity around the world in the last decade. This popularity made Twitter a common target for spammers and malicious users to spread unwanted advertisements, viruses and phishing attacks. In this article, we review the latest research works to determine the most effective features that were investigated for spam detection in the literature. These features are collected to build a comprehensive data set that can be used to develop more robust an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
30
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 46 publications
(30 citation statements)
references
References 19 publications
0
30
0
Order By: Relevance
“…Both regulations and technical solutions are necessary to stop spammers and retain the users’ trust in their platforms. As spam diminishes the productivity, trustworthiness and performance of the OSNs, causing to expose the users to different security threats, it is considered very important to mitigate or to totally eliminate spam profiles [18]. In our previous work [19], the analysis was based on the algorithm itself, unlike this work which focuses more on the analytical side of the features.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Both regulations and technical solutions are necessary to stop spammers and retain the users’ trust in their platforms. As spam diminishes the productivity, trustworthiness and performance of the OSNs, causing to expose the users to different security threats, it is considered very important to mitigate or to totally eliminate spam profiles [18]. In our previous work [19], the analysis was based on the algorithm itself, unlike this work which focuses more on the analytical side of the features.…”
Section: Introductionmentioning
confidence: 99%
“…This indicates that the German users are content-based, while the Indonesian and Malay tend to be more social and message broadcasting–based. A growing body of research is targeting social spam with the goal to improve spam detection methods in the context of OSNs [18,25]. However, there is less focus in the literature on understanding the social spam in different lingual contexts in OSNs.…”
Section: Introductionmentioning
confidence: 99%
“…", Whether contains the mark "!" [3]. For each news item in the training set, we extract its linguistic features and train an XGBoost [1] classifier using these features.…”
Section: Perturbation By Replacementmentioning
confidence: 99%
“…Therefore, the identification technology of spammers has been extensively studied. According to different areas, the current research areas on spammer identification technologies can be divided into the e-mail field, the field of social networks, the field of news media, the e-commerce field [20]- [22]. According to different research methods, it can be divided into spammer identification technologies based on user behavioral features, semantic features, and environmental features [28], [29].…”
Section: Related Workmentioning
confidence: 99%