Gender Biases and Where to Find Them: Exploring Gender Bias in Pre-Trained Transformer-based Language Models Using Movement Pruning

Przemyslaw Joniak,Akiko Aizawa

doi:10.18653/v1/2022.gebnlp-1.6

Abstract

Language model debiasing has emerged as an important field of study in the NLP community. Numerous debiasing techniques were proposed, but bias ablation remains an unaddressed issue. We demonstrate a novel framework for inspecting bias in pre-trained transformer-based language models via movement pruning. Given a model and a debiasing objective, our framework finds a subset of the model containing less bias than the original model. We implement our framework by pruning the model while fine-tuning it on the debiasing objective. Optimized are only the pruning scores - parameters coupled with the model's weights that act as gates. We experiment with pruning attention heads, an important building block of transformers: we prune square blocks, as well as establish a new way of pruning the entire heads. Lastly, we demonstrate the usage of our framework using gender bias, and based on our findings, we propose an improvement to an existing debiasing method. Additionally, we re-discover a bias-performance trade-off: the better the model performs, the more bias it contains.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Gender Biases and Where to Find Them: Exploring Gender Bias in Pre-Trained Transformer-based Language Models Using Movement Pruning

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2022
Citations: 4	License type: cc-by

Similar Papers

A Study of Vietnamese Sentiment Classification with Ensemble Pre-Trained Language Models
Dang Van Thin ... Duong Ngoc Hao
Vietnam Journal of Computer Science | VOL. 11
Dang Van Thin, et. al.Dang Van Thin ... Duong Ngoc Hao
07 Dec 2023
Vietnam Journal of Computer Science | VOL. 11

Identification of Semantically Similar Sentences in Clinical Notes: Iterative Intermediate Training Using Multi-Task Learning.
Diwakar Mahajan ... Ananya Poddar
JMIR Medical Informatics | VOL. 8
Diwakar Mahajan, et. al.Diwakar Mahajan ... Ananya Poddar
27 Nov 2020
JMIR Medical Informatics | VOL. 8

Arabic abstractive text summarization using RNN-based and transformer-based architectures
Mohammad Bani-Almarjeh ... Mohamad-Bassam Kurdy
Information Processing & Management | VOL. 60
Mohammad Bani-Almarjeh, et. al.Mohammad Bani-Almarjeh ... Mohamad-Bassam Kurdy
26 Dec 2022
Information Processing & Management | VOL. 60

Deep entity matching with pre-trained language models
Yuliang Li ... Jinfeng Li
Proceedings of the VLDB Endowment | VOL. 14
Yuliang Li, et. al.Yuliang Li ... Jinfeng Li
01 Sep 2020
Proceedings of the VLDB Endowment | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gender Biases and Where to Find Them: Exploring Gender Bias in Pre-Trained Transformer-based Language Models Using Movement Pruning

Abstract

Talk to us

Similar Papers