2019 International Conference on Cyber Situational Awareness, Data Analytics and Assessment (Cyber SA) 2019
DOI: 10.1109/cybersa.2019.8899669
|View full text |Cite
|
Sign up to set email alerts
|

A Comparison of Machine Learning Approaches for Detecting Misogynistic Speech in Urban Dictionary

Abstract: Recent moves to consider misogyny as a hate crime have refocused efforts for owners of web properties to detect and remove misogynistic speech. This paper considers the use of deep learning techniques for detection of misogyny in Urban Dictionary, a crowdsourced online dictionary for slang words and phrases. We compare the performance of two deep learning techniques, Bi-LSTM and Bi-GRU, to detect misogynistic speech with the performance of more conventional machine learning techniques, logistic regression, Nai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(8 citation statements)
references
References 42 publications
0
8
0
Order By: Relevance
“…Several recent studies evidence the growing interest of the scientific community on automatic detection of hate speech, mainly for English [7][8][9][10][11][12]. This research area has grown mainly thanks to the competitions organized at SemEval [13] (e.g., HatEval, OffensEval, and Toxic Spans Detection) and other venues, such as TRAC [14] and HASOC [15].…”
Section: Related Workmentioning
confidence: 99%
“…Several recent studies evidence the growing interest of the scientific community on automatic detection of hate speech, mainly for English [7][8][9][10][11][12]. This research area has grown mainly thanks to the competitions organized at SemEval [13] (e.g., HatEval, OffensEval, and Toxic Spans Detection) and other venues, such as TRAC [14] and HASOC [15].…”
Section: Related Workmentioning
confidence: 99%
“…Studies Online detection [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], [25], [26], [27], [28], [29], [30], [31], [32], [33], [34], [35], [36], [37], [38], [9], [39], [40], [41], [42], [43], [8], [44], [45], [46], [47], [48], [49] Offline detection [50], [51], [52], [53], [54], [7], [55], [56] Safety [57], [13], [58], [59], [60], [61],…”
Section: Categorymentioning
confidence: 99%
“…This is followed by image or video files (17.3%), signals (11.5%), voice recordings (1.9%), and user input (1.9%). Methods for data collection included the Twitter API [21], Facebook Graph API [33], Urban Dictionary API [30], existing datasets [24], manual retrieval of online data [28], or experiments [55]. • Features engineering: As seen in Fig.…”
Section: ) Answer To the Second Research Questionmentioning
confidence: 99%
“…One problem is the lack of high-quality datasets to train machine learning models, which would enable the creation of efficient and scalable automated detection systems . Previous research has primarily used Twitter data and there is a pressing need for other platforms to be researched Lynn et al (2019a). Notably, de-spite social scientific studies that show online misogyny is pervasive on some Reddit communities, to date a training dataset for misogyny has not been created with Reddit data.…”
Section: Introductionmentioning
confidence: 99%
“…Lynn et al (2019b) provide a dataset of 2k Urban Dictionary definitions of which half are labelled as misogynistic. In Lynn et al (2019a) they show that deep learning techniques had greater accuracy in detecting misogyny than conventional machine learning techniques.…”
Section: Introductionmentioning
confidence: 99%