2022
DOI: 10.1007/s13278-022-00906-8
|View full text |Cite
|
Sign up to set email alerts
|

Predicting the type and target of offensive social media posts in Marathi

Abstract: The presence of offensive language on social media is very common motivating platforms to invest in strategies to make communities safer. This includes developing robust machine learning systems capable of recognizing offensive content online. Apart from a few notable exceptions, most research on automatic offensive language identification has dealt with English and a few other high resource languages such as French, German, and Spanish. In this paper we address this gap by tackling offensive language identifi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(1 citation statement)
references
References 38 publications
0
1
0
Order By: Relevance
“…We adopt the guidelines introduced by the popular OLID annotation taxonomy (Zampieri et al, 2019a) used in the OffensEval shared task (Zampieri et al, 2019b) and replicated in multiple other datasets in languages such as Danish (Sigurbergsson and Derczynski, 2020), Greek (Pitenis et al, 2020), Marathi (Gaikwad et al, 2021;Zampieri et al, 2022), Portuguese (Sigurbergsson and Derczynski, 2020), Sinhala (Ranasinghe et al, 2022) and Turkish (Çöltekin, 2020). We choose OLID due to the flexibility provided by its threelevel hierarchical taxonomy that allows us to model different types of offensive and abusive content (e.g., hate speech, cyberbulling, etc.)…”
Section: Datamentioning
confidence: 99%
“…We adopt the guidelines introduced by the popular OLID annotation taxonomy (Zampieri et al, 2019a) used in the OffensEval shared task (Zampieri et al, 2019b) and replicated in multiple other datasets in languages such as Danish (Sigurbergsson and Derczynski, 2020), Greek (Pitenis et al, 2020), Marathi (Gaikwad et al, 2021;Zampieri et al, 2022), Portuguese (Sigurbergsson and Derczynski, 2020), Sinhala (Ranasinghe et al, 2022) and Turkish (Çöltekin, 2020). We choose OLID due to the flexibility provided by its threelevel hierarchical taxonomy that allows us to model different types of offensive and abusive content (e.g., hate speech, cyberbulling, etc.)…”
Section: Datamentioning
confidence: 99%