2022
DOI: 10.1016/j.is.2020.101584
|View full text |Cite
|
Sign up to set email alerts
|

Hate speech detection is not as easy as you may think: A closer look at model validation (extended version)

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

5
171
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 98 publications
(176 citation statements)
references
References 11 publications
5
171
0
Order By: Relevance
“…Besides English (Basile et al, 2019;Waseem and Hovy, 2016;Davidson et al, 2017;Founta et al, 2018;Qian et al, 2018), we notice a growing interest in the study of hate speech in other languages, such as Portuguese (Fortuna et al, 2019), Italian (Sanguinetti et al, 2018), German (Ross et al, 2016), Indonesian (Ibrohim and Budi, 2019), French (Ousidhoum et al, 2019), Dutch (Hee et al, 2015, and Arabic (Albadi et al, 2018;Mulki et al, 2019;Ousidhoum et al, 2019). Challenging questions being tackled in this area involve the way abusive language spreads online (Mathew et al, 2019), fast changing topics during data collection (Liu et al, 2019), user bias in publicly available datasets (Arango et al, 2019), bias in hate speech classification and different methods to reduce it (Park et al, 2018;Davidson et al, 2019;Kennedy et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Besides English (Basile et al, 2019;Waseem and Hovy, 2016;Davidson et al, 2017;Founta et al, 2018;Qian et al, 2018), we notice a growing interest in the study of hate speech in other languages, such as Portuguese (Fortuna et al, 2019), Italian (Sanguinetti et al, 2018), German (Ross et al, 2016), Indonesian (Ibrohim and Budi, 2019), French (Ousidhoum et al, 2019), Dutch (Hee et al, 2015, and Arabic (Albadi et al, 2018;Mulki et al, 2019;Ousidhoum et al, 2019). Challenging questions being tackled in this area involve the way abusive language spreads online (Mathew et al, 2019), fast changing topics during data collection (Liu et al, 2019), user bias in publicly available datasets (Arango et al, 2019), bias in hate speech classification and different methods to reduce it (Park et al, 2018;Davidson et al, 2019;Kennedy et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
“…Moreover, the account-based dataset of (Mulki et al, 2019), referred to as AR3 in Figures 3 and 4 shows more robustness towards keywords. Nevertheless, such a collection strategy may generate a linguistic bias that goes with the same stylistic features used by the targeted accounts, similarly to Waseem and Hovy (2016)'s user bias reported by Arango et al (2019).…”
Section: Robustness Of Keyword-based Selectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Waseem [46] wrote about the annotator's influence on hate speech detection, showing expert annotations contributed to a better detection rate. Arango et al [1] analyzed a model validation problem of hate speech detection.…”
Section: Related Work 21 Hate Speech Detectionmentioning
confidence: 99%
“…of online social networks, hate speech is spreading faster and affecting a larger population than before in human history across the world 1 . Therefore, quickly and accurately identifying hate speech becomes crucial for keeping a harmonic and healthy online social environment, mitigating the possible conflicts, and protecting the diversity of our society.…”
mentioning
confidence: 99%