Facing the Challenges of Developing Fair Risk Scoring Models.

Gero Szepannek,Karsten Lübke

doi:10.3389/frai.2021.681915

Abstract

Algorithmic scoring methods are widely used in the finance industry for several decades in order to prevent risk and to automate and optimize decisions. Regulatory requirements as given by the Basel Committee on Banking Supervision (BCBS) or the EU data protection regulations have led to an increasing interest and research activity on understanding black box machine learning models by means of explainable machine learning. Even though this is a step into a right direction, such methods are not able to guarantee for a fair scoring as machine learning models are not necessarily unbiased and may discriminate with respect to certain subpopulations such as a particular race, gender, or sexual orientation—even if the variable itself is not used for modeling. This is also true for white box methods like logistic regression. In this study, a framework is presented that allows analyzing and developing models with regard to fairness. The proposed methodology is based on techniques of causal inference and some of the methods can be linked to methods from explainable machine learning. A definition of counterfactual fairness is given together with an algorithm that results in a fair scoring model. The concepts are illustrated by means of a transparent simulation and a popular real-world example, the German Credit data using traditional scorecard models based on logistic regression and weight of evidence variable pre-transform. In contrast to previous studies in the field for our study, a corrected version of the data is presented and used. With the help of the simulation, the trade-off between fairness and predictive accuracy is analyzed. The results indicate that it is possible to remove unfairness without a strong performance decrease unless the correlation of the discriminative attributes on the other predictor variables in the model is not too strong. In addition, the challenge in explaining the resulting scoring model and the associated fairness implications to users is discussed.

Highlights

The use of algorithmic scoring methods is very common in the finance industry for several decades in order to prevent risk and to automate and optimize decisions (Crook et al, 2007)
Different definitions of fairness are presented from the credit risk scoring point of view as well as a fairness correction algorithm based on the concept of counterfactual fairness
The idea of population stability is transferred into a new group unfairness index which allows quantifying and comparing the degree of group fairness of different scoring models

Summary

INTRODUCTION

The use of algorithmic scoring methods is very common in the finance industry for several decades in order to prevent risk and to automate and optimize decisions (Crook et al, 2007). Regulatory requirements as given by the Basel Committee on Banking Supervision (BCBS) (European Banking Authority, 2017) or the EU data protection regulations (Goodman and Flaxman, 2017) have led to an increasing interest and research activity on understanding black box machine learning models by means of explainable machine learning (cf e.g., Bücker et al, 2021) Even though this is a step into a right direction, such methods are not able to guarantee for a fair scoring as machine learning models. This allows for a fairness comparison of different models.

Overview

Causal Inference

Counterfactual Fairness

Quantifying Fairness

Visual Analysis of Fairness

From German Credit Data to South German Credit Data

Simulation of the Protected Attribute

RESULTS AND DISCUSSION

SUMMARY

DATA AVAILABILITY STATEMENT

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in artificial intelligence	Publication Date: Oct 14, 2021
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Facing the Challenges of Developing Fair Risk Scoring Models.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in artificial intelligence

Lead the way for us

Similar Papers

The Basel Committee on Banking Supervision: a post-crisis assessment of governance and accountability
Maziar Peihani
Canadian Foreign Policy Journal | VOL. 21
Maziar PeihaniMaziar Peihani
30 Jan 2015
Canadian Foreign Policy Journal | VOL. 21

Compliance with BSBC Regulations by the Banks in Sri Lanka
Nitish Kaushik
SSRN Electronic Journal | VOL. -
Nitish KaushikNitish Kaushik
17 Mar 2013
SSRN Electronic Journal | VOL. -

Explainable Machine Learning to Predict Successful Weaning Among Patients Requiring Prolonged Mechanical Ventilation: A Retrospective Cohort Study in Central Taiwan.
Jiun-Long Wang ... Ming-Yen Lin
Frontiers in medicine | VOL. 8
Jiun-Long Wang, et. al.Jiun-Long Wang ... Ming-Yen Lin
23 Apr 2021
Frontiers in medicine | VOL. 8

Commentary: Can we crack the black box of machine learning for aortic aneurysms?
Edgar Aranda-Michel ... Ibrahim Sultan
The Journal of Thoracic and Cardiovascular Surgery | VOL. 166
Edgar Aranda-Michel, et. al.Edgar Aranda-Michel ... Ibrahim Sultan
27 Dec 2021
The Journal of Thoracic and Cardiovascular Surgery | VOL. 166

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Facing the Challenges of Developing Fair Risk Scoring Models.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in artificial intelligence