2020
DOI: 10.48550/arxiv.2005.00962
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Gender Gap in Natural Language Processing Research: Disparities in Authorship and Citations

Abstract: Disparities in authorship and citations across genders can have substantial adverse consequences not just on the disadvantaged gender, but also on the field of study as a whole. In this work, we examine female first author percentages and the citations to their papers in Natural Language Processing. We find that only about 29% of first authors are female and only about 25% of last authors are female. Notably, this percentage has not improved since the mid 2000s. We also show that, on average, female first auth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…As we have discussed, this is not the case. Some studies even find that the increase in the share of women in authorship has led to an increase of gender differences in both productivity and impact (Huang et al, 2020;Mohammad et al, 2020).…”
Section: Article-level Explanationsmentioning
confidence: 99%
See 1 more Smart Citation
“…As we have discussed, this is not the case. Some studies even find that the increase in the share of women in authorship has led to an increase of gender differences in both productivity and impact (Huang et al, 2020;Mohammad et al, 2020).…”
Section: Article-level Explanationsmentioning
confidence: 99%
“…At the article level, however, there is no clear gap. While some studies of some fields find that women are cited less per article (e.g, Larivière et al, 2013;Mohammad et al, 2020), or are under-cited per reference lists (Håkanson, 2005;Lutz, 1990), more studies find that articles written by women receive comparable, sometimes even higher rates, than articles written by men (e.g., Healy, 2015;Huang et al, 2020;Leahey et al, 2017;Long, 1992).…”
Section: Introductionmentioning
confidence: 99%
“…Because AND mistakes can cause representational and allocational harm, and no system will ever be perfect, it is critical that any live AND service easily allows authors to correct mistakes made by the system. Other than using self-reported demographic attributes [25], research has focused on using inferred gender from names for studying gender disparities in authorship and citation trends [26,27,28]. Similarly, Bertrand and Mullainathan [29] used inferred gender and race for studying disparities in hiring.…”
Section: Previous Workmentioning
confidence: 99%