Explaining Deep Learning Models for Credit Scoring with SHAP: A Case Study Using Open Banking Data

Lars Ole Hjelkrem,Petter Eilif De Lange

doi:10.3390/jrfm16040221

Lars Ole Hjelkrem, Petter Eilif De Lange

Open Access

https://doi.org/10.3390/jrfm16040221

Copy DOI

Abstract

Predicting creditworthiness is an important task in the banking industry, as it allows banks to make informed lending decisions and manage risk. In this paper, we investigate the performance of two different deep learning credit scoring models developed on the textual descriptions of customer transactions available from open banking APIs. The first model is a deep learning model trained from scratch, while the second model uses transfer learning with a multilingual BERT model. We evaluate the predictive performance of these models using the area under the receiver operating characteristic curve (AUC) and Brier score. We find that a deep learning model trained from scratch outperforms a BERT transformer model finetuned on the same data. Furthermore, we find that SHAP can be used to explain such models both on a global level and for explaining rejections of actual applications.

Full Text