Forum for Information Retrieval Evaluation 2020
DOI: 10.1145/3441501.3441517
|View full text |Cite
|
Sign up to set email alerts
|

Overview of the HASOC Track at FIRE 2020: Hate Speech and Offensive Language Identification in Tamil, Malayalam, Hindi, English and German

Abstract: With the growth of social media, the spread of hate speech is also increasing rapidly. Social media are widely used in many countries. Also Hate Speech is spreading in these countries. This brings a need for multilingual Hate Speech detection algorithms. Much research in this area is dedicated to English at the moment. The HASOC track intends to provide a platform to develop and optimize Hate Speech detection algorithms for Hindi, German and English. The dataset is collected from a Twitter archive and pre-clas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
77
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 105 publications
(78 citation statements)
references
References 36 publications
1
77
0
Order By: Relevance
“…Initially, a set of words was used to collect tweets, and then some keywords that were not frequent in offensive content were excluded during the trail annotation. Similarly, for the dataset of the HASOC track [18], the data were acquired using hashtags and keywords with offensive content. Here, we aim to use a keyword-based technique to evaluate our keyword extraction method by analyzing how these keywords can influence the detection of offensive language.…”
Section: Abstract In Offensive Language Detectionmentioning
confidence: 99%
“…Initially, a set of words was used to collect tweets, and then some keywords that were not frequent in offensive content were excluded during the trail annotation. Similarly, for the dataset of the HASOC track [18], the data were acquired using hashtags and keywords with offensive content. Here, we aim to use a keyword-based technique to evaluate our keyword extraction method by analyzing how these keywords can influence the detection of offensive language.…”
Section: Abstract In Offensive Language Detectionmentioning
confidence: 99%
“…In terms of computational methods, recent work has employed deep neural models such as convolutional neural networks (CNNs) and long, short-term memory (LSTM). With the introduction of transformer-based models, most notably BERT [23], neural transformer models [24] have been widely applied in offensive language identification, topping the leaderboards of competitions such as HatEval [3], HASOC [25], OffensEval [2], and TRAC [18].…”
Section: Related Workmentioning
confidence: 99%
“…The HASOC shared task, which stands for "hate speech and offensive content identification", in Indo-European Languages is arguably the most well-known series of competitions including languages from India [25,31]. It has been organized in 2019 and 2020 at the Forum for Information Retrieval (FIRE).…”
Section: Offensive Language Identification In Languages From Indiamentioning
confidence: 99%
“…Prior work has either designed methods for identifying conversations that are likely to go awry (Zhang WARNING: This paper contains text excerpts and words that are offensive in nature. Chang et al, 2020) or detecting offensive content and labelling posts at the instances level -this has been the focus in the recent shared tasks like HASOC at FIRE 2019 (Mandl et al, 2019a) and FIRE 2020 (Mandl et al, 2020), Ger-mEval 2019 Task 2 (Struß et al, 2019), TRAC (Kumar et al, 2018, HatEval (Basile et al, 2019a), OffensEval at SemEval-2019 (Zampieri et al, 2019b) and SemEval-2020 .…”
Section: Introductionmentioning
confidence: 99%