2022
DOI: 10.1371/journal.pone.0264270
|View full text |Cite
|
Sign up to set email alerts
|

Avoiding bias when inferring race using name-based approaches

Abstract: Racial disparity in academia is a widely acknowledged problem. The quantitative understanding of racial-based systemic inequalities is an important step towards a more equitable research system. However, because of the lack of robust information on authors’ race, few large-scale analyses have been performed on this topic. Algorithmic approaches offer one solution, using known information about authors, such as their names, to infer their perceived race. As with any other algorithm, the process of racial infere… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
19
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 30 publications
(19 citation statements)
references
References 28 publications
0
19
0
Order By: Relevance
“…The underrepresentation of Black people in most data sets means their race will be misrecognized more often than their white peers. 24 Moreover, within the US Black population, migration, social trends and movements, class, and other factors shape who goes by distinctively Black names, and thus who is ascribed Black identity by other people and algorithms. 16 Among the Black social scientists in our sample, those whose parent(s) have PhDs were correctly recognized as Black more often than those whose parent(s) did not attend college.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…The underrepresentation of Black people in most data sets means their race will be misrecognized more often than their white peers. 24 Moreover, within the US Black population, migration, social trends and movements, class, and other factors shape who goes by distinctively Black names, and thus who is ascribed Black identity by other people and algorithms. 16 Among the Black social scientists in our sample, those whose parent(s) have PhDs were correctly recognized as Black more often than those whose parent(s) did not attend college.…”
Section: Discussionmentioning
confidence: 99%
“…Even attempts to correct for these inequalities in error rates can be thrown off by them. For example, Kozlowski et al's 24 approach to compensating for the high rate at which Black people are racially mislabeled assumes they are all mislabeled at the same rate. If their corrected data was used in an analysis of parental education or class, however, the uneven rates of racial misclassification we show for parental education would still confound the analysis.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In other words, we do not assign authors to a unique racial category. In previous work (58), we have shown that, given the overlap of Black and White family names (59,60), the use of a threshold-filtering those names with a probability for a single group above a threshold and assigning all authors with that name to that single category-underestimates the proportion of Black authors. This distinction is critical: We do not aim to identify each author's self-perceived racial category but to build aggregates of racial group disparities.…”
Section: Methodsmentioning
confidence: 97%
“…When necessary, we relied on images. A limitation of this method and the study is that we used only a binary gender classification (men-women) and did not consider other genders or groups (Kozlowski et al, 2022).…”
Section: Methodsmentioning
confidence: 99%