2022
DOI: 10.1109/access.2022.3147588
|View full text |Cite
|
Sign up to set email alerts
|

Cross-Lingual Few-Shot Hate Speech and Offensive Language Detection Using Meta Learning

Abstract: Automatic detection of abusive online content such as hate speech, offensive language, threats, etc. has become prevalent in social media, with multiple efforts dedicated to detecting this phenomenon in English. However, detecting hatred and abuse in low-resource languages is a non-trivial challenge. The lack of sufficient labeled data in low-resource languages and inconsistent generalization ability of transformer-based multilingual pre-trained language models for typologically diverse languages make these mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 38 publications
(29 citation statements)
references
References 56 publications
0
29
0
Order By: Relevance
“…Among the diverse topics investigated, the predominant data reservoir for numerous studies has been derived from Twitter. Subsequently, Facebook has been another substantial source (Bilal et al, 2022; MacAvaney et al, 2019; Mozafari et al, 2022; Rodriguez et al, 2022; Sreelakshmi et al, 2020). Additionally, alternative platforms such as YouTube comments have been utilized (Kumar Roy et al, 2022; Roy et al, 2022; Sajid et al, 2020), along with resources like Wikipedia (Beddiar et al, 2021) and diverse online platforms (Alatawi et al, 2021; Beddiar et al, 2021).…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Among the diverse topics investigated, the predominant data reservoir for numerous studies has been derived from Twitter. Subsequently, Facebook has been another substantial source (Bilal et al, 2022; MacAvaney et al, 2019; Mozafari et al, 2022; Rodriguez et al, 2022; Sreelakshmi et al, 2020). Additionally, alternative platforms such as YouTube comments have been utilized (Kumar Roy et al, 2022; Roy et al, 2022; Sajid et al, 2020), along with resources like Wikipedia (Beddiar et al, 2021) and diverse online platforms (Alatawi et al, 2021; Beddiar et al, 2021).…”
Section: Discussionmentioning
confidence: 99%
“…Numerous research investigations have meticulously delved into the realm of identifying hate speech, with a notable emphasis on discerning hate speech within political contexts, as is evident from the works of (Oriola & Kotze, 2020; Ribeiro et al, 2018; Wang et al, 2022). Additionally, the academic discourse extends to the scrutiny of hate speech intertwined with religious themes, as underscored by (Al‐Hassan & Al‐Dossari, 2022; Ali et al, 2021; Ghosh et al, 2023; Mozafari et al, 2022; Qureshi & Sabih, 2021; Sajid et al, 2020). Visual representation of the distribution of these topics is offered in Figure 12 and Table 9, which convey the number of instances within each thematic domain.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…X-MAML explores various auxiliary languages to identify the optimal composition for zero-shot cross-lingual transfer. Meta-learning has also been applied in the detection of offensive language in cross-lingual and code-mixed texts [40], [41] and other harmful content such as multilingual rumours [42]. However, the limited availability of multilingual hate speech datasets, comprising of only two or three languages, presents a challenge in finding an effective auxiliary language.…”
Section: Related Work Multilingual Hate Speech Detectionmentioning
confidence: 99%
“…They have conducted experiments for a binary hate speech classification task in Multilingual-Train Monolingual-Test, Monolingual-Train Monolingual-Test, and Language-Family-Train Monolingual Test scenarios. Mozafari et al [13] investigated the feasibility of applying a meta-learning approach in cross-lingual few-shot hate speech detection by leveraging two meta-learning models based on optimization-based and metric-based (MAML and Proto-MAML) methods. These findings demonstrate the varying performance of different machine learning approaches in hate speech detection, depending on the language and dataset under consideration.…”
Section: Introductionmentioning
confidence: 99%