Interpretable high-stakes decision support system for credit default forecasting

Weixin Sun,Xuantao Zhang,Minghao Li,Yong Wang

doi:10.1016/j.techfore.2023.122825

Abstract

Methods for forecasting credit default have long been research focus for financial institutions. In this study, we propose an interpretable high-stakes decision support system for credit default forecasting called CDFS. Because of the high-stake nature of credit default prediction, the proposed CDFS adheres to the principle of "people in the loop." The proposed CDFS comprises six modules: data processing, feature selection, data balancing, forecasting, evaluation, and interpretation. A feature selection method (permutation importance method), nine resampling methods, and six high-performance forecasting methods were employed in the proposed CDFS. The China Taiwan credit card default dataset and South Germany credit dataset were used to test the interpretability and predictive performance of the proposed CDFS. Experiments showed that the feature selection and data balancing modules of the CDFS effectively improve the prediction performance. A comparison with traditional logistic regression models demonstrated that the CDFS can provide decision-makers with satisfactory explanations for prediction results. In summary, the CDFS proposed in this study exhibited excellent predictive performance and satisfactory interpretability. This study contributes to improving the accuracy of credit default forecasting and reducing credit risk in financial institutions.

Full Text