2022
DOI: 10.48550/arxiv.2209.08681
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Domain Classification-based Source-specific Term Penalization for Domain Adaptation in Hate-speech Detection

Abstract: Warning: this paper contains content that may be offensive and distressing.State-of-the-art approaches for hate-speech detection usually exhibit poor performance in out-of-domain settings. This occurs, typically, due to classifiers overemphasizing sourcespecific information that negatively impacts its domain invariance. Prior work has attempted to penalize terms related to hatespeech from manually curated lists using feature attribution methods, which quantify the importance assigned to input terms by the clas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 32 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?