Assessing gender bias in machine translation: a case study with Google Translate

Marcelo O R Prates,Pedro H Avelar,Luís C Lamb

doi:10.1007/s00521-019-04144-6

Marcelo O R Prates, Pedro H Avelar + Show 1 more

Open Access

https://doi.org/10.1007/s00521-019-04144-6

Copy DOI

Abstract

Recently there has been a growing concern in academia, industrial research laboratories and the mainstream commercial media about the phenomenon dubbed as machine bias, where trained statistical models—unbeknownst to their creators—grow to reflect controversial societal asymmetries, such as gender or racial bias. A significant number of Artificial Intelligence tools have recently been suggested to be harmfully biased toward some minority, with reports of racist criminal behavior predictors, Apple’s Iphone X failing to differentiate between two distinct Asian people and the now infamous case of Google photos’ mistakenly classifying black people as gorillas. Although a systematic study of such biases can be difficult, we believe that automated translation tools can be exploited through gender neutral languages to yield a window into the phenomenon of gender bias in AI. In this paper, we start with a comprehensive list of job positions from the U.S. Bureau of Labor Statistics (BLS) and used it in order to build sentences in constructions like “He/She is an Engineer” (where “Engineer” is replaced by the job position of interest) in 12 different gender neutral languages such as Hungarian, Chinese, Yoruba, and several others. We translate these sentences into English using the Google Translate API, and collect statistics about the frequency of female, male and gender neutral pronouns in the translated output. We then show that Google Translate exhibits a strong tendency toward male defaults, in particular for fields typically associated to unbalanced gender distribution or stereotypes such as STEM (Science, Technology, Engineering and Mathematics) jobs. We ran these statistics against BLS’ data for the frequency of female participation in each job position, in which we show that Google Translate fails to reproduce a real-world distribution of female workers. In summary, we provide experimental evidence that even if one does not expect in principle a 50:50 pronominal gender distribution, Google Translate yields male defaults much more frequently than what would be expected from demographic data alone. We believe that our study can shed further light on the phenomenon of machine bias and are hopeful that it will ignite a debate about the need to augment current statistical translation tools with debiasing techniques—which can already be found in the scientific literature.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Assessing gender bias in machine translation: a case study with Google Translate

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Journal: Neural Computing and Applications	Publication Date: Mar 27, 2019
Citations: 179

Similar Papers

신경망 기계번역 내 젠더 문제 고찰 연구 – 네이버 파파고와 구글 번역을 중심으로 –
Yun-Joo Jee
Interpretation and Translation | VOL. 23
Yun-Joo JeeYun-Joo Jee
31 Mar 2021
Interpretation and Translation | VOL. 23

How to measure gender bias in machine translation: Real-world oriented machine translators, multiple reference points
Anna Farkas ... Renáta Németh
Social Sciences & Humanities Open | VOL. 5
Anna Farkas, et. al.Anna Farkas ... Renáta Németh
01 Jan 2021
Social Sciences & Humanities Open | VOL. 5

Is science, technology, engineering and mathematics in higher education sexist and racist? All surface, no substance
Reham Elmorally ... Meggie Copsey-Blake
Equity in Education & Society | VOL. 1
Reham Elmorally, et. al.Reham Elmorally ... Meggie Copsey-Blake
28 May 2022
Equity in Education & Society | VOL. 1

Good, but not always Fair: An Evaluation of Gender Bias for three Commercial Machine Translation Systems
Silvia Alma Piazzolla ... Beatrice Savoldi
HERMES - Journal of Language and Communication in Business | VOL. -
Silvia Alma Piazzolla, et. al.Silvia Alma Piazzolla ... Beatrice Savoldi
17 Jan 2024
HERMES - Journal of Language and Communication in Business | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessing gender bias in machine translation: a case study with Google Translate

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications