Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2020
DOI: 10.18653/v1/2020.emnlp-main.606
|View full text |Cite
|
Sign up to set email alerts
|

HABERTOR: An Efficient and Effective Deep Hatespeech Detector

Abstract: We present our HABERTOR model for detecting hatespeech in large scale user-generated content. Inspired by the recent success of the BERT model, we propose several modifications to BERT to enhance the performance on the downstream hatespeech classification task. HABERTOR inherits BERT's architecture, but is different in four aspects: (i) it generates its own vocabularies and is pre-trained from the scratch using the largest scale hatespeech dataset; (ii) it consists of Quaternionbased factorized components, res… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(3 citation statements)
references
References 43 publications
0
3
0
Order By: Relevance
“…Hateful content detection has been extensively studied, with numerous papers presenting new datasets, models and architectures for hateful content detection, including several subtasks, such as multilingual, multimodal and multi-target hate detection [29,61,68]. Notably, incorporating sentence-level context has been shown to improve hateful content detection, with several papers deploying transformer-based models such as Bidirectional Encoder Representation from Transformers (BERT) to better distinguish between hateful and non-hateful content, even when they have lexical similarities [37,57]. Indeed, as Mullah and Zainon [46] write in their comprehensive review of ML methods for automated hate speech detection, deep learning techniques leveraging such language models can considerably improve how context-dependent hate speech is detected.…”
Section: Automated Methods Of Online Hate Detectionmentioning
confidence: 99%
“…Hateful content detection has been extensively studied, with numerous papers presenting new datasets, models and architectures for hateful content detection, including several subtasks, such as multilingual, multimodal and multi-target hate detection [29,61,68]. Notably, incorporating sentence-level context has been shown to improve hateful content detection, with several papers deploying transformer-based models such as Bidirectional Encoder Representation from Transformers (BERT) to better distinguish between hateful and non-hateful content, even when they have lexical similarities [37,57]. Indeed, as Mullah and Zainon [46] write in their comprehensive review of ML methods for automated hate speech detection, deep learning techniques leveraging such language models can considerably improve how context-dependent hate speech is detected.…”
Section: Automated Methods Of Online Hate Detectionmentioning
confidence: 99%
“…Other works have attempted to mitigate biases, such as gender bias (Dinan et al, 2020a) and racial bias (Sap et al, 2019). Recently, Tran et al (2020) modify BERT (Devlin et al, 2019) to detect hatespeech. introduce a method to distill safety standards into the generative dialogue agent.…”
Section: Related Workmentioning
confidence: 99%
“…Another challenge of practical importance is that state-of-the-art deep learning networks are extremely large, reaching up to hundreds of millions, or billions of parameters. Ideally, hate detection would be performed in real time at the end device, allowing the alert the user prior to posting hateful content, for example, but high-performing compact hate detection models are required to achieve this goal (Tran et al, 2020;Behzadi et al, 2021).…”
Section: Introductionmentioning
confidence: 99%