2021
DOI: 10.1145/3421504
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Offensive Language Detection for the Arabic Language

Abstract: The use of offensive language in user-generated content is a serious problem that needs to be addressed with the latest technology. The field of Natural Language Processing (NLP) can support the automatic detection of offensive language. In this survey, we review previous NLP studies that cover Arabic offensive language detection. This survey investigates the state-of-the-art in offensive language detection for the Arabic language, providing a structured overview of previous approaches, including core techniqu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 38 publications
(29 citation statements)
references
References 28 publications
0
29
0
Order By: Relevance
“…Actually, personal knowledge and culture are both crucial elements in defining what is offensive and what is not. A word used in a written or spoken communication is offensive if it contains conduct meant to cause hurt, pain, or anger [10], [16]. Hate speech and cyberbullying are two significant forms of offensive language, their prevalence on social media has recently risen.…”
Section: Background 21 Offensive Languagementioning
confidence: 99%
See 2 more Smart Citations
“…Actually, personal knowledge and culture are both crucial elements in defining what is offensive and what is not. A word used in a written or spoken communication is offensive if it contains conduct meant to cause hurt, pain, or anger [10], [16]. Hate speech and cyberbullying are two significant forms of offensive language, their prevalence on social media has recently risen.…”
Section: Background 21 Offensive Languagementioning
confidence: 99%
“…It is described as "any use of modern digital technology to propagate racial, religious, extremist, or terrorist ideas" [18]. Hate speech can be classified into the following categories: gendered and religious [10], [19].…”
Section: Hate Speechmentioning
confidence: 99%
See 1 more Smart Citation
“…Binary SVM classifiers have employed textual content as feature vectors by implementing feature selection on each word (uni-grams). More specifically, they applied the method dictated by the bag-of-words language model 32 , and every individual is considered as a single word feature. Each feature cumulatively forms the data set, which represents the corpus consisting of a document per column matrix.…”
Section: Marketing and Business Analysismentioning
confidence: 99%
“…Therefore, linguistic idiom is a feature of Greek social text that needs to be studied separately. Nevertheless, researchers have studied the reaction patterns of social media users in the language of their interest, such as Czech [31], Arabic [32], as well as in different languages [33] on various applications, including sentiment analysis, regarding either a single language or a multilingual setting [19,20,34].…”
Section: Introductionmentioning
confidence: 99%