Proceedings of the 28th International Conference on Computational Linguistics 2020
DOI: 10.18653/v1/2020.coling-main.511
|View full text |Cite
|
Sign up to set email alerts
|

Semi-supervised Multi-task Learning for Multi-label Fine-grained Sexism Classification

Abstract: Sexism, a form of oppression based on one's sex, manifests itself in numerous ways and causes enormous suffering. In view of the growing number of experiences of sexism reported online, categorizing these recollections automatically can assist the fight against sexism, as it can facilitate effective analyses by gender studies researchers and government officials involved in policy making. In this paper, we investigate the fine-grained, multi-label classification of accounts (reports) of sexism. To the best of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 31 publications
0
5
0
Order By: Relevance
“…Online sexism is a pervasive problem that can harm women and create hostile environments. Sexism detection in social media content has become an emerging field of natural language processing and social computing (Jha and Mamidi, 2017;Karlekar and Bansal, 2018;Zhang and Luo, 2019;Parikh et al, 2019;Abburi et al, 2020;Chiril et al, 2020;Rodríguez-Sánchez et al, 2021;Sen et al, 2022). In the Automatic Misogyny Identification (AMI) shared task at IberEval and EvalIta 2018, participants were tasked with identifying instances of sexist behavior in tweets and categorizing them based on a taxonomy proposed by Anzovino et al (2018).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Online sexism is a pervasive problem that can harm women and create hostile environments. Sexism detection in social media content has become an emerging field of natural language processing and social computing (Jha and Mamidi, 2017;Karlekar and Bansal, 2018;Zhang and Luo, 2019;Parikh et al, 2019;Abburi et al, 2020;Chiril et al, 2020;Rodríguez-Sánchez et al, 2021;Sen et al, 2022). In the Automatic Misogyny Identification (AMI) shared task at IberEval and EvalIta 2018, participants were tasked with identifying instances of sexist behavior in tweets and categorizing them based on a taxonomy proposed by Anzovino et al (2018).…”
Section: Related Workmentioning
confidence: 99%
“…In the same context, Chiril et al (2020) have introduced a French sexism detection method based on BERT contextualized word embeddings complemented with both linguistic features and generalization strategies. Abburi et al (2020) have proposed a semi-supervised multi-task learning method using BERT model for multi-label fine-grained sexism classification. Another line of work uses counterfactually augmented data to improve out-of-domain generalizability for sexism and hate speech detection (Sen et al, 2022).…”
Section: Related Workmentioning
confidence: 99%
“…Apart from detection of directed hateful content from tweets, there has been work on identifying categories of sexism from personal accounts such as (Karlekar and Bansal, 2018). ) created a dataset having 23 labeled categories of sexism from sexism accounts without maintaining mutual exclusivity in the categories and proposed a multi-task approach involving three auxiliary tasks for the multilabel classification in (Abburi et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
“…The classifier combines sentence embeddings generated using a BERT [13] model with those generated from ELMo [31] and GloVe [30] embeddings using biLSTM with attention and CNN. Abburi et al [1] explore a multitask approach for semi-supervised sexism classification that deploys three auxiliary tasks such as estimating the topic proportion distribution, predicting the cluster label and detecting an account of sexism without inflicting any manual labeling cost. They also explore the objective functions that make use of label correlations present in the training data.…”
Section: Related Workmentioning
confidence: 99%
“…To the best of our knowledge, Parikh et al [28] is the only work that explores the multi-label categorization of accounts of sexism using machine learning and considers more than three categories of sexism. It provides the largest dataset containing accounts drawn from 'Everyday Sexism Project' 1 , where experiences of sexism are shared from all over the world. The textual accounts are annotated using 23 categories of sexism formulated with the help of a social scientist.…”
Section: Introductionmentioning
confidence: 99%