Negative Associations in Word Embeddings Predict Anti-black Bias across Regions-but Only via Name Frequency.

Austin Van Loon,Salvatore Giorgi,Johannes Eichstaedt,Robb Willer

doi:10.1609/icwsm.v16i1.19399

Abstract

The word embedding association test (WEAT) is an important method for measuring linguistic biases against social groups such as ethnic minorities in large text corpora. It does so by comparing the semantic relatedness of words prototypical of the groups (e.g., names unique to those groups) and attribute words (e.g., 'pleasant' and 'unpleasant' words). We show that anti-Black WEAT estimates from geo-tagged social media data at the level of metropolitan statistical areas strongly correlate with several measures of racial animus-even when controlling for sociodemographic covariates. However, we also show that every one of these correlations is explained by a third variable: the frequency of Black names in the underlying corpora relative to White names. This occurs because word embeddings tend to group positive (negative) words and frequent (rare) words together in the estimated semantic space. As the frequency of Black names on social media is strongly correlated with Black Americans' prevalence in the population, this results in spuriously high anti-Black WEAT estimates wherever few Black Americans live. This suggests that research using the WEAT to measure bias should consider term frequency, and also demonstrates the potential consequences of using black-box models like word embeddings to study human cognition and behavior.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Negative Associations in Word Embeddings Predict Anti-black Bias across Regions-but Only via Name Frequency.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media

Lead the way for us

Journal: Proceedings of the International AAAI Conference on Web and Social Media	Publication Date: May 31, 2022
Citations: 6

Similar Papers

Negative Associations in Word Embeddings Predict anti-Black Bias Across Regions--but only via Name Frequency
...
-
, et. al. ...
08 May 2022
08 May 2022

Word Embeddings for Natural Language Processing

-

01 Jan 2015
01 Jan 2015

Use of positive and negative words in scientific PubMed abstracts between 1974 and 2014: retrospective analysis
Christiaan H Vinkers ... Willem M Otte
BMJ | VOL. 351
Christiaan H Vinkers, et. al.Christiaan H Vinkers ... Willem M Otte
14 Dec 2015
Use of positive and negative words in scientific PubMed abstracts between 1974 and 2014: retrospective analysis
Christiaan H Vinkers ... Willem M Otte

Regions, Periods, Activities
Chao Zhang ... Haoruo Peng
-
Chao Zhang, et. al.Chao Zhang ... Haoruo Peng
03 Apr 2017
03 Apr 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Negative Associations in Word Embeddings Predict Anti-black Bias across Regions-but Only via Name Frequency.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media