Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu 2018
DOI: 10.18653/v1/n18-2003
|View full text |Cite
|
Sign up to set email alerts
|

Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods

Abstract: We introduce a new benchmark, WinoBias, for coreference resolution focused on gender bias. Our corpus contains Winograd-schema style sentences with entities corresponding to people referred by their occupation (e.g. the nurse, the doctor, the carpenter). We demonstrate that a rule-based, a feature-rich, and a neural coreference system all link gendered pronouns to pro-stereotypical entities with higher accuracy than anti-stereotypical entities, by an average difference of 21.1 in F1 score. Finally, we demonstr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

7
581
1
2

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 498 publications
(591 citation statements)
references
References 24 publications
7
581
1
2
Order By: Relevance
“…Model versus data. Recent NLP research has focused on gender biases in word embeddings (Bolukbasi et al, 2016;Zhao et al, 2017) and co-reference systems (Zhao et al, 2018;Rudinger et al, 2018).…”
Section: What Makes This Study Different?mentioning
confidence: 99%
“…Model versus data. Recent NLP research has focused on gender biases in word embeddings (Bolukbasi et al, 2016;Zhao et al, 2017) and co-reference systems (Zhao et al, 2018;Rudinger et al, 2018).…”
Section: What Makes This Study Different?mentioning
confidence: 99%
“…To create our corpus, we use the templates from the WinoBias corpus created by Zhao et al (2018). This corpus contains sentences with job titles that are gender neutral, yet contain a stereotypical bias towards one gender (doctors and CEOs are male, nurses and housekeepers female).…”
Section: Anaphora Resolution and Gender Biasmentioning
confidence: 99%
“…Stereotypical referents The intermediate conclusion that the language model performs successful anaphora resolution on our experiment also provides us the opportunity to probe the gender biases of the model. To do so, we repeat the pronoun preference test on an adapted version of the WinoBias corpus (Zhao et al, 2018), in which all referents are only stereotypically considered to be male or female (e.g., doctor and nurse).…”
Section: Unambiguous Referents Inmentioning
confidence: 99%
“…This means that if the gender of one word changes, the others have to be updated to match. As a result, simple heuristics, such as augmenting a corpus with additional sentences in which he and she have been swapped (Zhao et al, 2018), will yield ungrammatical sentences. Consider the Spanish phrase el ingeniero experto (the skilled engineer).…”
Section: Introductionmentioning
confidence: 99%