Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop 2019
DOI: 10.18653/v1/p19-2051
|View full text |Cite
|
Sign up to set email alerts
|

Cross-domain and Cross-lingual Abusive Language Detection: A Hybrid Approach with Deep Learning and a Multilingual Lexicon

Abstract: The development of computational methods to detect abusive language in social media within variable and multilingual contexts has recently gained significant traction. The growing interest is confirmed by the large number of benchmark corpora for different languages developed in the latest years. However, abusive language behaviour is multifaceted and available datasets are featured by different topical focuses. This makes abusive language detection a domain-dependent task, and building a robust system to dete… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
92
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
2

Relationship

4
5

Authors

Journals

citations
Cited by 68 publications
(93 citation statements)
references
References 22 publications
1
92
0
Order By: Relevance
“…For example, our survey captures a great availability of benchmark datasets for the evaluation of abusive language and hate speech detection systems, in several languages and with several topical focuses. This adds to the challenge of investigating architectures which are stable and well-performing across different languages and abusive domains, making it a more and more promising topic to research (Corazza et al 2020;Pamungkas and Patti 2019;Ousidhoum et al 2019).…”
Section: Lexical Analysismentioning
confidence: 99%
“…For example, our survey captures a great availability of benchmark datasets for the evaluation of abusive language and hate speech detection systems, in several languages and with several topical focuses. This adds to the challenge of investigating architectures which are stable and well-performing across different languages and abusive domains, making it a more and more promising topic to research (Corazza et al 2020;Pamungkas and Patti 2019;Ousidhoum et al 2019).…”
Section: Lexical Analysismentioning
confidence: 99%
“…This evaluation gap is being bridged recently by evaluation campaigns for English, Spanish (SemEval [10]), German [11], and Italian (EVALITA [12]), whose shared tasks released annotated datasets for hate speech detection. The availability of benchmarks for system evaluation and datasets for hate speech detection in different languages made the challenge of investigating architectures, which are also stable and well-performing across different languages, an exciting issue to research [13,14].…”
Section: Related Workmentioning
confidence: 99%
“…Detecting abusive language for less-resourced languages is complex, and has inspired research in multilingual and cross-lingual methods [16]. These methods are especially useful when the involved languages are morphologically or geographically close [18]. In our work, we investigate hate speech detection methods for English, Croatian, and Slovene languages.…”
Section: Hate Speech Detectionmentioning
confidence: 99%