A Novel Method for Credit Scoring Based on Cost-Sensitive Neural Network Ensemble

Wirot Yotsawat,Anongnart Srivihok,Pakaket Wattuya

doi:10.1109/access.2021.3083490

Abstract

Most existing studies on credit scoring adapted a concept of classifier ensemble for solving an imbalanced dataset. They apply resampling methods to generate multiple training subsets for constructing multiple base classifiers. However, this approach leads to several problems that degrade the classification performance, such as problems of information loss, model overfitting, and computational cost. Thus, we propose a novel ensemble approach for developing a credit scoring model based on a cost-sensitive neural network, called Cost-sensitive Neural Network Ensemble (CS-NNE). In the proposed approach, multiple class weights are adapted to original training data, enabling the multiple base neural networks to consider imbalanced classes. Following this approach, a high diversity of multiple base classifiers without consequent problems can be achieved. The approach's effectiveness is evaluated on five real-world credit datasets. Among them is a loan-requesting dataset provided by a financial institution in Thailand. The remaining datasets are publicly available and widely used by several existing studies. The experimental results showed that the proposed CS-NNE approach improves the predictive performance over a single neural network based on imbalanced credit datasets, e.g., Thai credit dataset, by achieving 1.36%, 15.67%, and 6.11% Area under the ROC Curve (AUC), Default Detection Rate (DDR), and G-Mean (GM), respectively, and achieving the best Misclassification Cost (MC). The proposed CS-NNE approach can effectively solve a class of imbalance problems and outperform many existing models. The prediction model can well compromise between classes of default (bad credit applicants) and non-default (good credit applicants), whereas existing approaches preferred a class of non-default over default loans (having high specificity and low DDR), resulting in NPL.

Highlights

A credit scoring model is a statistical analysis tool that determines the creditworthiness of a loan applicant by estimating the probability of default based on historical data [1]
The proposed approach can address the problems in the credit scoring task and improve the performance of credit scoring model
Credit scoring model has become a powerful tool for banks and other financial institutions to assess the creditworthiness of applicants

Summary

INTRODUCTION

A credit scoring model is a statistical analysis tool that determines the creditworthiness of a loan applicant by estimating the probability of default based on historical data [1]. Wei et al [8] combined the outlier removal method and classification algorithm to develop a credit scoring model called backflow learning It was relearned the misclassified data points and combined the prediction of based learners by a two-layer ensemble. By the indirect cost-sensitive methods, in 2018, He et al [5] and Sun et al [7] introduced the idea of generated training subsets using different resampling rates for ensemble classifiers to develop a credit scoring model. Their results were superior to other comparative algorithms. Popular ensemble methods, such as RF [53], XGBoost [54], Bagging [55], and AdaBoost [56], are included

EXPERIMENTS

EXPERIMENTAL RESULTS

RESULTS ON THE THAI CREDIT DATASET

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 24	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Novel Method for Credit Scoring Based on Cost-Sensitive Neural Network Ensemble

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Bibliography
-
-
--
23 Dec 2016
23 Dec 2016

Imbalance Learning and Its Application on Medical Datasets
Yachao Shao
-
Yachao ShaoYachao Shao
21 Feb 2022
21 Feb 2022

A new evaluation measure for learning from imbalanced data
Nguyen Thai-Nghe ... Lars Schmidt-Thieme
-
Nguyen Thai-Nghe, et. al.Nguyen Thai-Nghe ... Lars Schmidt-Thieme
01 Jul 2011
01 Jul 2011

ImbTreeEntropy and ImbTreeAUC: Novel R Packages for Decision Tree Learning on the Imbalanced Datasets
Krzysztof Gajowniczek ... Tomasz Ząbkowski
Electronics | VOL. 10
Krzysztof Gajowniczek, et. al.Krzysztof Gajowniczek ... Tomasz Ząbkowski
11 Mar 2021
Electronics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Method for Credit Scoring Based on Cost-Sensitive Neural Network Ensemble

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access