Are We Consistently Biased? Multidimensional Analysis of Biases in Distributional Word Vectors

Anne Lauscher,Goran Glavaš

doi:10.18653/v1/s19-1010

Abstract

Word embeddings have recently been shown to reflect many of the pronounced societal biases (e.g., gender bias or racial bias). Existing studies are, however, limited in scope and do not investigate the consistency of biases across relevant dimensions like embedding models, types of texts, and different languages. In this work, we present a systematic study of biases encoded in distributional word vector spaces: we analyze how consistent the bias effects are across languages, corpora, and embedding models. Furthermore, we analyze the cross-lingual biases encoded in bilingual embedding spaces, indicative of the effects of bias transfer encompassed in cross-lingual transfer of NLP models. Our study yields some unexpected findings, e.g., that biases can be emphasized or downplayed by different embedding models or that user-generated content may be less biased than encyclopedic text. We hope our work catalyzes bias research in NLP and informs the development of bias reduction techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Are We Consistently Biased? Multidimensional Analysis of Biases in Distributional Word Vectors

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2019
Citations: 48	License type: cc-by-nc-sa

Similar Papers

A General Framework for Implicit and Explicit Debiasing of Distributional Word Vector Spaces
Anne Lauscher ... Simone Paolo Ponzetto
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Anne Lauscher, et. al.Anne Lauscher ... Simone Paolo Ponzetto
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

An interactive method for measuring gender bias and evaluating bias in Chinese word embeddings
Chunlin Qin ... Yan Liu
-
Chunlin Qin, et. al.Chunlin Qin ... Yan Liu
15 Apr 2023
15 Apr 2023

Exploring What Is Encoded in Distributional Word Vectors: A Neurobiologically Motivated Analysis
Akira Utsumi
Cognitive Science | VOL. 44
Akira UtsumiAkira Utsumi
26 May 2020
Cognitive Science | VOL. 44

Gender and Racial Stereotype Detection in Legal Opinion Word Embeddings
Sean Matthews ... John Hudzina
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Sean Matthews, et. al.Sean Matthews ... John Hudzina
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Are We Consistently Biased? Multidimensional Analysis of Biases in Distributional Word Vectors

Abstract

Talk to us

Similar Papers