2021
DOI: 10.48550/arxiv.2109.05322
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Latent Hatred: A Benchmark for Understanding Implicit Hate Speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0
1

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(12 citation statements)
references
References 46 publications
0
11
0
1
Order By: Relevance
“…Hate speech takes different forms in different social platforms (Wiegand, Ruppenhofer, and Kleinbauer 2019) and across time (Florio et al 2020). It is often implicit (ElSherief et al 2021), targeting a variety of groups. Consequently, transfer learning remains a challenge for hate-speech detection, and annotated Parler data is needed in order to achieve accurate classification.…”
Section: Parler Data Annotationmentioning
confidence: 99%
“…Hate speech takes different forms in different social platforms (Wiegand, Ruppenhofer, and Kleinbauer 2019) and across time (Florio et al 2020). It is often implicit (ElSherief et al 2021), targeting a variety of groups. Consequently, transfer learning remains a challenge for hate-speech detection, and annotated Parler data is needed in order to achieve accurate classification.…”
Section: Parler Data Annotationmentioning
confidence: 99%
“…Most research focused on overt forms of hate speech, but explicit hate is more easily identifiable, e.g., by lexicon-based methods (Davidson et al, 2017). Recent research focused on implicitness of hate speech and proposed new datasets (Jurgens et al, 2019;ElSherief et al, 2021;Wiegand et al, 2021;Ocampo et al, 2023;Nejadgholi et al, 2022;Hartvigsen et al, 2022). A study developed a taxonomy of implicit hate and provided a labeled dataset (ElSherief et al, 2021), but its annotation scheme is not generalizable in the Korean context (e.g., White Grievance).…”
Section: Related Workmentioning
confidence: 99%
“…The value of 1 means there is room for disagreement about whether it is an offensive expression speech or not. This category includes implicit hate expressions, such as sarcasm, stereotypes of a target group, etc (ElSherief et al, 2021). The labeling decision could depend on the context of expression.…”
Section: A2 Detailed Guidelinementioning
confidence: 99%
See 1 more Smart Citation
“…Recent works on hate speech understanding (Sap et al, 2019b;ElSherief et al, 2021;Huang et al, 2022) have considered training autoregressive language models to generate underlying explanations on hate speech. The models are trained on humanwritten free-text rationales such as implied statements and targeted groups.…”
Section: Introductionmentioning
confidence: 99%