2022
DOI: 10.48550/arxiv.2201.08451
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Regional Negative Bias in Word Embeddings Predicts Racial Animus--but only via Name Frequency

Abstract: The word embedding association test (WEAT) is an important method for measuring linguistic biases against social groups such as ethnic minorities in large text corpora. It does so by comparing the semantic relatedness of words prototypical of the groups (e.g., names unique to those groups) and attribute words (e.g., 'pleasant' and 'unpleasant' words). We show that anti-black WEAT estimates from geo-tagged social media data at the level of metropolitan statistical areas strongly correlate with several measures … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 21 publications
0
1
0
Order By: Relevance
“…Wolfe and Caliskan (2021), for instance, show that linguistic bias against ethnic minorities' names is strongly correlated with how often the names appear in the underlying corpus. Van Loon et al (2022) show that this can cause omitted variable bias in social scientific analyses. They find significant and robust associations between anti-Black linguistic bias on social media and several non-linguistic measures of anti-Black animus, but also find each association to be spurious-with each being controlled away by the relative prevalences of Black and White names on social media.…”
Section: Discussionmentioning
confidence: 98%
“…Wolfe and Caliskan (2021), for instance, show that linguistic bias against ethnic minorities' names is strongly correlated with how often the names appear in the underlying corpus. Van Loon et al (2022) show that this can cause omitted variable bias in social scientific analyses. They find significant and robust associations between anti-Black linguistic bias on social media and several non-linguistic measures of anti-Black animus, but also find each association to be spurious-with each being controlled away by the relative prevalences of Black and White names on social media.…”
Section: Discussionmentioning
confidence: 98%