Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion 2022
DOI: 10.18653/v1/2022.ltedi-1.37
|View full text |Cite
|
Sign up to set email alerts
|

Nozza@LT-EDI-ACL2022: Ensemble Modeling for Homophobia and Transphobia Detection

Abstract: In this paper, we describe our approach for the task of homophobia and transphobia detection in English social media comments. The dataset consists of YouTube comments, and it has been released for the shared task on Homophobia/Transphobia Detection in social media comments. Given the high class imbalance, we propose a solution based on data augmentation and ensemble modeling. We fine-tuned different large language models (BERT, RoBERTa, and HateBERT) and used the weighted majority vote on their predictions. O… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 26 publications
(27 reference statements)
0
2
0
Order By: Relevance
“…In one study, the ensemble transformer-based model was implemented to classify homophobia and transphobia on social media comments in Tamil and Tamil-English datasets [1]. Similarly, to classify the comments as homophobia and transphobia in English, different transformer-based models (BERT, RoBERTa and HateBERT) were implemented [2].…”
Section: Related Workmentioning
confidence: 99%
“…In one study, the ensemble transformer-based model was implemented to classify homophobia and transphobia on social media comments in Tamil and Tamil-English datasets [1]. Similarly, to classify the comments as homophobia and transphobia in English, different transformer-based models (BERT, RoBERTa and HateBERT) were implemented [2].…”
Section: Related Workmentioning
confidence: 99%
“…[12] employed a RoBERTa-based approach to explore on the task [10] data. Given the substantial minority class, the authors of [13] authors proposed a method using data augmentation and ensembles modelling. They fine-tuned big linguistic models utilised the balanced democratic majority on their predictions.…”
Section: Literature Surveymentioning
confidence: 99%