Fairness-Aware Structured Pruning in Transformers

Abdelrahman Zayed,Gonçalo Mordido,Sarath Chandar,Samira Shabanian,Ioana Baldini

doi:10.1609/aaai.v38i20.30256

Abstract

The increasing size of large language models (LLMs) has introduced challenges in their training and inference. Removing model components is perceived as a solution to tackle the large model sizes, however, existing pruning methods solely focus on performance, without considering an essential aspect for the responsible use of LLMs: model fairness. It is crucial to address the fairness of LLMs towards diverse groups, such as women, Black people, LGBTQ+, Jewish communities, among others, as they are being deployed and available to a wide audience. In this work, first, we investigate how attention heads impact fairness and performance in pre-trained transformer-based language models. We then propose a novel method to prune the attention heads that negatively impact fairness while retaining the heads critical for performance, i.e. language modeling capabilities. Our approach is practical in terms of time and resources, as it does not require fine-tuning the final pruned, and fairer, model. Our findings demonstrate a reduction in gender bias by 19%, 19.5%, 39.5%, 34.7%, 23%, and 8% for DistilGPT-2, GPT-2, GPT-Neo of two different sizes, GPT-J, and Llama 2 models, respectively, in comparison to the biased model, with only a slight decrease in performance. WARNING: This work uses language that is offensive in nature.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fairness-Aware Structured Pruning in Transformers

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 24, 2024
Citations: 3

Similar Papers

Gender Biases and Where to Find Them: Exploring Gender Bias in Pre-Trained Transformer-based Language Models Using Movement Pruning
Przemyslaw Joniak ... Akiko Aizawa
-
Przemyslaw Joniak, et. al.Przemyslaw Joniak ... Akiko Aizawa
01 Jan 2021
01 Jan 2021

A Study of Vietnamese Sentiment Classification with Ensemble Pre-Trained Language Models
Dang Van Thin ... Duong Ngoc Hao
Vietnam Journal of Computer Science | VOL. 11
Dang Van Thin, et. al.Dang Van Thin ... Duong Ngoc Hao
07 Dec 2023
Vietnam Journal of Computer Science | VOL. 11

Identification of Semantically Similar Sentences in Clinical Notes: Iterative Intermediate Training Using Multi-Task Learning.
Diwakar Mahajan ... Ananya Poddar
JMIR Medical Informatics | VOL. 8
Diwakar Mahajan, et. al.Diwakar Mahajan ... Ananya Poddar
27 Nov 2020
JMIR Medical Informatics | VOL. 8

Arabic abstractive text summarization using RNN-based and transformer-based architectures
Mohammad Bani-Almarjeh ... Mohamad-Bassam Kurdy
Information Processing & Management | VOL. 60
Mohammad Bani-Almarjeh, et. al.Mohammad Bani-Almarjeh ... Mohamad-Bassam Kurdy
26 Dec 2022
Information Processing & Management | VOL. 60

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fairness-Aware Structured Pruning in Transformers

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence