2023
DOI: 10.1145/3603399
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Harmful Content on Online Platforms: What Platforms Need vs. Where Research Efforts Go

Abstract: The proliferation of harmful content on online platforms is a major societal problem, which comes in many different forms including hate speech, offensive language, bullying and harassment, misinformation, spam, violence, graphic content, sexual abuse, self harm, and many other. Online platforms seek to moderate such content to limit societal harm, to comply with legislation, and to create a more inclusive environment for their users. Researchers have developed different methods for automatically detecting har… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 20 publications
(6 citation statements)
references
References 78 publications
0
6
0
Order By: Relevance
“…Strategies and resources have also been put forward for identifying threatening content in low-resource languages [63,64]. Additionally, comprehensive surveys on threat detection techniques and moderation policies on tackling such content by online platforms have been conducted [65,66]. Many languages still lack sufficient linguistic resources for NLP-related tasks [67].…”
Section: Downstream Tasks In Hausa Languagementioning
confidence: 99%
“…Strategies and resources have also been put forward for identifying threatening content in low-resource languages [63,64]. Additionally, comprehensive surveys on threat detection techniques and moderation policies on tackling such content by online platforms have been conducted [65,66]. Many languages still lack sufficient linguistic resources for NLP-related tasks [67].…”
Section: Downstream Tasks In Hausa Languagementioning
confidence: 99%
“…7 Specifically, on May 23rd we queried Twitter for all the accounts that shared a tweet in UK-RU and FR-22, obtaining almost 2M users that were suspended by the platform for violating their rules. Twitter might suspend an account in a variety of circumstances that range from promoting violence and glorifying crime to hate speech, spam, and impersonation; similarly to other Big Tech platforms, these guidelines are considered among the most stringent [ 61 ]. More details about reasons for suspension are available in the Twitter documentation.…”
Section: Data Collectionmentioning
confidence: 99%
“…While there are several surveys on toxic language detection (Arora et al, 2023; Balayn et al, 2021; Chhabra & Kumar, 2023; Fortuna & Nunes, 2019;Poletto et al, 2021; Vidgen & Derczynski, 2020), with some of them focusing on Arabic language (ALBayari et al, 2021; Al‐Hassan & Al‐Dossari, 2019; Alsunaidi et al, 2023; Elzayady et al, 2022; Husain & Uzuner, 2021; Khairy et al, 2021), no initiative has yet focused on the Arabic datasets in this domain. Previous seminal initiatives focusing on reviewing toxic language resources (Poletto et al, 2021; Vidgen & Derczynski, 2020) are not language‐specific and referenced only a limited number of Arabic datasets.…”
Section: Introductionmentioning
confidence: 99%