2020
DOI: 10.1007/s10579-020-09502-8
|View full text |Cite
|
Sign up to set email alerts
|

Resources and benchmark corpora for hate speech detection: a systematic review

Abstract: Hate Speech in social media is a complex phenomenon, whose detection has recently gained significant traction in the Natural Language Processing community, as attested by several recent review works. Annotated corpora and benchmarks are key resources, considering the vast number of supervised approaches that have been proposed. Lexica play an important role as well for the development of hate speech detection systems. In this review, we systematically analyze the resources made available by the community at la… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
210
0
14

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 272 publications
(265 citation statements)
references
References 66 publications
(61 reference statements)
2
210
0
14
Order By: Relevance
“…In the last few years several works contributed to the development of HS detection automatic methods, both releasing novel annotated resources, lexicons of hate words or presenting automated classifiers. Two surveys (Schmidt and Wiegand 2017;Fortuna and Nunes 2018) and a systematic review were recently published on this topic (Poletto et al 2020). For what concerns Italian, a few resources have been recently developed using data from Twitter (Sanguinetti et al 2018;Poletto et al 2017;Patti 2019), Facebook (Del Vigna et al 2017) and Instagam (Corazza et al 2019).…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…In the last few years several works contributed to the development of HS detection automatic methods, both releasing novel annotated resources, lexicons of hate words or presenting automated classifiers. Two surveys (Schmidt and Wiegand 2017;Fortuna and Nunes 2018) and a systematic review were recently published on this topic (Poletto et al 2020). For what concerns Italian, a few resources have been recently developed using data from Twitter (Sanguinetti et al 2018;Poletto et al 2017;Patti 2019), Facebook (Del Vigna et al 2017) and Instagam (Corazza et al 2019).…”
Section: Related Workmentioning
confidence: 99%
“…For a more complete overview of the available HS resources, including lexica and benchmark datasets, in Italian and in other languages, we refer to Poletto et al (2020).…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Other systematic literature review such as [6] use soft computing techniques and [7] use benchmark corpora. In this research, we focus on using the word "hate speech" which according to [8] is a term that hits a particular community or individual that makes them suffer, while the opposition doesn't care.…”
Section: International Journal On Informatics Visualization Vol 4 (20mentioning
confidence: 99%