Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models

Spliethöver, Maximilian; Wachsmuth, Henning

doi:10.24963/ijcai.2021/77

Cited by 4 publications

(3 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The recent debiasing models (Bolukbasi et al, 2016;Wang et al, 2020) have only focused on removing gender bias in pre-trained word embeddings, particularly GloVe (Pennington et al, 2014), which has surfaced several social biases (Spliethöver and Wachsmuth, 2021). In this paper, we propose to mitigate five types of biases in GloVe embeddings, i.e., gender, race, religion, age, and LGBTQ+.…”

Section: Mitigating Multiple Biases In Glovementioning

confidence: 99%

A Multibias-mitigated and Sentiment Knowledge Enriched Transformer for Debiasing in Multimodal Conversational Emotion Recognition

Wang¹,

Ma²,

Zhang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Multimodal emotion recognition in conversations (mERC) is an active research topic in natural language processing (NLP), which aims to predict human's emotional states in communications of multiple modalities, e,g., natural language and facial gestures. Innumerable implicit prejudices and preconceptions fill human language and conversations, leading to the question of whether the current datadriven mERC approaches produce a biased error. For example, such approaches may offer higher emotional scores on the utterances by females than males. In addition, the existing debias models mainly focus on gender or race, where multibias mitigation is still an unexplored task in mERC. In this work, we take the first step to solve these issues by proposing a series of approaches to mitigate five typical kinds of bias in textual utterances (i.e., gender, age, race, religion and LGBTQ+) and visual representations (i.e, gender and age), followed by a Multibias-Mitigated and sentiment Knowledge Enriched bi-modal Transformer (MMKET). Comprehensive experimental results show the effectiveness of the proposed model and prove that the debias operation has a great impact on the classification performance for mERC. We hope our study will benefit the development of bias mitigation in mERC and related emotion studies. * Jinlin Wang and Fang Ma contribute equally to this work and share the co-first authorship.

show abstract

Section: Mitigating Multiple Biases In Glovementioning

confidence: 99%

A Multibias-mitigated and Sentiment Knowledge Enriched Transformer for Debiasing in Multimodal Conversational Emotion Recognition

Wang¹,

Ma²,

Zhang³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Using WEAT makes our results comparable with related work. We calculate WEAT scores using the implementation of the WEFE framework (Badilla et al, 2020) and use word lists of Spliethöver and Wachsmuth (2021).…”

Section: Evaluating Social Bias In Embeddingsmentioning

confidence: 99%

“…This impacts generalization performance negatively (Shah et al, 2020) and may have harmful consequences in practical applications (Bender et al, 2021;Joseph and Morgan, 2020). So far, one hurdle to mitigate these problems is the limited reliability of common measures of social bias present in a corpus (Spliethöver and Wachsmuth, 2021), stemming from embedding training algorithms not tailored to low-resource situations (Knoche et al, 2019;Spinde et al, 2021).…”

Section: Introductionmentioning

confidence: 99%

No Word Embedding Model Is Perfect: Evaluating the Representation Accuracy for Social Bias in the Media

Spliethöver¹,

Maximilian²,

Wachsmuth³

2022

Preprint

View full text Add to dashboard Cite

News articles both shape and reflect public opinion across the political spectrum. Analyzing them for social bias can thus provide valuable insights, such as prevailing stereotypes in society and the media, which are often adopted by NLP models trained on respective data. Recent work has relied on word embedding bias measures, such as WEAT. However, several representation issues of embeddings can harm the measures' accuracy, including lowresource settings and token frequency differences. In this work, we study what kind of embedding algorithm serves best to accurately measure types of social bias known to exist in US online news articles. To cover the whole spectrum of political bias in the US, we collect 500k articles and review psychology literature with respect to expected social bias. We then quantify social bias using WEAT along with embedding algorithms that account for the aforementioned issues. We compare how models trained with the algorithms on news articles represent the expected social bias. Our results suggest that the standard way to quantify bias does not align well with knowledge from psychology. While the proposed algorithms reduce the gap, they still do not fully match the literature.

show abstract

A Multibias-Mitigated and Sentiment Knowledge Enriched Transformer for Debiasing in Multimodal Conversational Emotion Recognition

Wang

Zhang

et al. 2022

Natural Language Processing and Chinese Computing

View full text Add to dashboard Cite

Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models

Cited by 4 publications

References 17 publications

A Multibias-mitigated and Sentiment Knowledge Enriched Transformer for Debiasing in Multimodal Conversational Emotion Recognition

A Multibias-mitigated and Sentiment Knowledge Enriched Transformer for Debiasing in Multimodal Conversational Emotion Recognition

No Word Embedding Model Is Perfect: Evaluating the Representation Accuracy for Social Bias in the Media

A Multibias-Mitigated and Sentiment Knowledge Enriched Transformer for Debiasing in Multimodal Conversational Emotion Recognition

Contact Info

Product

Resources

About