2021
DOI: 10.25046/aj060187
|View full text |Cite
|
Sign up to set email alerts
|

Text Mining Techniques for Cyberbullying Detection: State of the Art

Abstract: The dramatic growth of social media during the last years has been associated with the emergence of a new bullying types. Platforms such as Facebook, Twitter, YouTube, and others are now privileged ways to disseminate all kinds of information. Indeed, communicating through social media without revealing the real identity has emerged an ideal atmosphere for cyberbullying, where people can pour out their hatred. Therefore, become very urgent to find automated methods to detect cyberbullying through text mining t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(12 citation statements)
references
References 25 publications
0
12
0
Order By: Relevance
“…Thus, the need for an existing Arabic dataset to evaluate automatic algorithms for detecting and classifying offensive speech is critical. Indeed, the majority of the available cyberbullying datasets are in English [19][20][21], while there are only a few available in Arabic. Hence, in this section, we provide an overview of the most recent Arabic datasets that addressed cyberbullying, abusive, and offensive language.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Thus, the need for an existing Arabic dataset to evaluate automatic algorithms for detecting and classifying offensive speech is critical. Indeed, the majority of the available cyberbullying datasets are in English [19][20][21], while there are only a few available in Arabic. Hence, in this section, we provide an overview of the most recent Arabic datasets that addressed cyberbullying, abusive, and offensive language.…”
Section: Related Workmentioning
confidence: 99%
“…Therefore, in our work, we introduce a corpus with (sub-class categorization (multi-class)) that could help in studying cognitive processes, such as investigating whether cyberbullying is linked with increased levels of anxiety in online audiences. .However, the majority of the available cyberbullying datasets are in English [19,20], with only a few available in Arabic, none of which are data collected from the Instagram platform [18,21]; the number of Arabic datasets available for offensive/hate speech/cyberbullying auto detection is limited compared to in the English language. Furthermore, none of them are collected from Instagram.…”
Section: Related Workmentioning
confidence: 99%
“…Social networking has become one of the most popular communication platforms nowadays. People of different ages, cultures, and socioeconomic classes utilize social network environments to communicate a variety of messages to a worldwide audience [1,2]. Twitter, Instagram, and Facebook are examples of social networking platforms that allow users to communicate and freely discuss their thoughts and opinions in a non-constrictive atmosphere.…”
Section: Introductionmentioning
confidence: 99%
“…It is worth noting that SPSS software restricted the following rules for correctly implementing Cohen's kappa coefficient: (1) The raters' responses were graded on a nominal scale, and the categories must be mutually exclusive (in our experiment: positive, negative, or neutral). (2) The response data were made up of paired observations of the same phenomena, which meant that both raters evaluated the identical observations (in our experiment, annotators evaluated the same comments, i.e., text). (3) The two raters were independent, which meant that one rater's decision did not influence the decision of the other (in our experiment, the annotators were independent).…”
mentioning
confidence: 99%
“…Social Media has become a part of daily life; It is increasingly hard to survive in this modern era of digital media without a digital footprint [1]. People are increasingly using social media platforms for various purposes and reasons, resulting in a vast amount of online data being created on a daily basis [2].…”
Section: Introductionmentioning
confidence: 99%