Proceedings of the Fourteenth Workshop on Semantic Evaluation 2020
DOI: 10.18653/v1/2020.semeval-1.208
|View full text |Cite
|
Sign up to set email alerts
|

NUIG at SemEval-2020 Task 12: Pseudo Labelling for Offensive Content Classification

Abstract: This work addresses the classification problem defined by sub-task A (English only) of the OffensEval 2020 challenge. We used a semi-supervised approach to classify given tweets into an offensive (OFF) or not-offensive (NOT) class. As the OffensEval 2020 dataset is loosely labelled with confidence scores given by unsupervised models, we used last year's offensive language identification dataset (OLID) to label the OffensEval 2020 dataset. Our approach uses a pseudo-labelling method to annotate the current data… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
43
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(43 citation statements)
references
References 18 publications
0
43
0
Order By: Relevance
“…In the coding scheme, we first presented examples and then asked the coders to annotate to what extent each video used cartoons using a 4-point Likert-type scale (0 = never ; 1 = a few ; 2 = many ; 3 = almost always ). We included both Internet memes and animated GIFs as memes (Suryawanshi et al, 2020; Tuters & Hagen, 2020), and the coders also saw examples of memes for reference before annotating the use of memes using the same 4-point Likert-type scale. For funny sounds, we presented the coders with a list of funny sounds 8 in the training phase.…”
Section: Methodsmentioning
confidence: 99%
“…In the coding scheme, we first presented examples and then asked the coders to annotate to what extent each video used cartoons using a 4-point Likert-type scale (0 = never ; 1 = a few ; 2 = many ; 3 = almost always ). We included both Internet memes and animated GIFs as memes (Suryawanshi et al, 2020; Tuters & Hagen, 2020), and the coders also saw examples of memes for reference before annotating the use of memes using the same 4-point Likert-type scale. For funny sounds, we presented the coders with a list of funny sounds 8 in the training phase.…”
Section: Methodsmentioning
confidence: 99%
“…The results show that the model's performance has increased by around 4% with the addition of the additional collection of features. Suryawanshi et al (2020) created a multimodal dataset on the 2016 U.S. presidential election that included 743 offensive and nonoffensive memes. They combined the multimodal features using the early fusion strategy.…”
Section: Related Workmentioning
confidence: 99%
“…Hateful meme classification is an emerging multimodal task made popular by the availability of several recent hateful memes datasets (Kiela et al, 2020;Suryawanshi et al, 2020;Gomez et al, 2020). For instance, Facebook had organized the Hateful Memes Challenge, which encouraged researchers to submit solutions to perform hateful memes classification (Kiela et al, 2020).…”
Section: Hateful Meme Detectionmentioning
confidence: 99%
“…Existing studies have explored classic twostream models that combine the text and visual features learned from text and image encoders using attention-based mechanisms and other fusion methods to perform hateful meme classification (Zhang et al, 2020;Kiela et al, 2020;Suryawanshi et al, 2020). Another popular line of approach is finetuning large scale pre-trained multimodal models for the task (Lippe et al, 2020;Zhu, 2020;Zhou and Chen, 2020;Muennighoff, 2020;Velioglu and Rose, 2020;Pramanick et al, 2021b;Hee et al, 2022).…”
Section: Hateful Meme Detectionmentioning
confidence: 99%
See 1 more Smart Citation