Bipol: A novel multi-axes bias evaluation metric with explainability for NLP

Lama Alkhaled,Tosin Adewumi,Sana Sabah Sabry

doi:10.1016/j.nlp.2023.100030

Lama Alkhaled, Tosin Adewumi + Show 1 more

Open Access

https://doi.org/10.1016/j.nlp.2023.100030

Copy DOI

Abstract

We introduce bipol, a new metric with explainability, for estimating social bias in text data. Harmful bias is prevalent in many online sources of data that are used for training machine learning (ML) models. In a step to address this challenge we create a novel metric that involves a two-step process: corpus-level evaluation based on model classification and sentence-level evaluation based on (sensitive) term frequency (TF). After creating new models to classify bias using SotA architectures, we evaluate two popular NLP datasets (COPA and SQuADv2) and the WinoBias dataset. As additional contribution, we created a large English dataset (with almost 2 million labeled samples) for training models in bias classification and make it publicly available. We also make public our codes.

Full Text