Fabian Floeck scite author profile

Research has focused on automated methods to effectively detect sexism online. Although overt sexism seems easy to spot, its subtle forms and manifold expressions are not. In this paper, we outline the different dimensions of sexism by grounding them in their implementation in psychological scales. From the scales, we derive a codebook for sexism in social media, which we use to annotate existing and novel datasets, surfacing their limitations in breadth and validity with respect to the construct of sexism. Next, we leverage the annotated datasets to generate adversarial examples, and test the reliability of sexism detection methods. Results indicate that current machine learning models pick up on a very narrow set of linguistic markers of sexism and do not generalize well to out-of-domain examples. Yet, including diverse data and adversarial examples at training time results in models that generalize better and that are more robust to artifacts of data collection. By providing a scale-based codebook and insights regarding the shortcomings of the state-of-the-art, we hope to contribute to the development of better and broader models for sexism detection, including reflections on theory-driven approaches to data collection.

show abstract

TED-On: A Total Error Framework for Digital Traces of Human Behavior on Online Platforms

Sen¹,

Floeck²,

Weller³

et al. 2019

Preprint

View full text Add to dashboard Cite

How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs?

Sen¹,

Samory²,

Floeck³

et al. 2021

View full text Add to dashboard Cite

As NLP models are increasingly deployed in socially situated settings such as online abusive content detection, it is crucial to ensure that these models are robust. One way of improving model robustness is to generate counterfactually augmented data (CAD) for training models that can better learn to distinguish between core features and data artifacts. While models trained on this type of data have shown promising out-of-domain generalizability, it is still unclear what the sources of such improvements are. We investigate the benefits of CAD for social NLP models by focusing on three social computing constructs -sentiment, sexism, and hate speech. Assessing the performance of models trained with and without CAD across different types of datasets, we find that while models trained on CAD show lower in-domain performance, they generalize better out-of-domain. We unpack this apparent discrepancy using machine explanations and find that CAD reduces model reliance on spurious features. Leveraging a novel typology of CAD to analyze their relationship with model performance, we find that CAD which acts on the construct directly or a diverse set of CAD leads to higher performance.

show abstract

"Call me sexist, but...": Revisiting Sexism Detection Using Psychological Scales and Adversarial Samples

Samory¹,

Sen²,

Kohne³

et al. 2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Fabian Floeck

Imitation and Quality of Tags in Social Bookmarking Systems – Collective Intelligence Leading to Folksonomies

“Call me sexist, but...” : Revisiting Sexism Detection Using Psychological Scales and Adversarial Samples

TED-On: A Total Error Framework for Digital Traces of Human Behavior on Online Platforms

How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs?

"Call me sexist, but...": Revisiting Sexism Detection Using Psychological Scales and Adversarial Samples

Contact Info

Product

Resources

About