Abstract

An ideal model for credit risk assessment is supposed to select important features and process imbalanced data sets in an effective manner. This paper proposes an integrated method that combines B&B (branch and bound)-based hybrid feature selection (BBHFS) with the imbalanceoriented multiple-classifier ensemble (IOMCE) for imbalanced credit risk assessment and uses the support vector machine (SVM) and the multiple discriminant analysis (MDA) as the base predictor. BBHFS is a hybrid feature selection method that integrates the t-test and B&B with the k-fold crossvalidation method to search for a satisfactory feature subset. The IOMCE divides majority samples into several subsets and then combines them with minority samples to construct several training sets for constructing a multiple-classifier ensemble model. We conduct main experiments using a 1:3 imbalanced corporate credit risk data set with continuous features and extended experiments using a 1:5 imbalanced data set with continuous features and a 1:3 imbalanced data set with discrete and nominal features. We combine no feature selection and five feature selection methods (the pure B&B, the factor analysis, the pure t-test, t-test & correlation analysis, and BBHFS) with single-classifier and the IOMCE to construct SVM and MDA models for an empirical comparison. When all features are continuous, the BBHFS-IOMCE method generally outperforms all the other methods. More specifically, BBHFS provides more stable and satisfactory results than the other feature selection methods, and compared with single-classifier models, IOMCE models can significantly enhance the recognition rate for minority samples while incurring a small reduction in the recognition rate for majority samples and maintaining an acceptable overall accuracy. When the features are almost discrete or nominal, the IOMCE method retains its ability to deal with an imbalanced data set, although the five feature selection methods have no significant advantages over no feature selection. This suggests that BBHFS is effective in retaining useful information when reducing the dimensionality of continuous features and that the BBHFS-IOMCE method is an important tool for imbalanced credit risk assessment.

Highlights

  • To prevent new financial crises derived from credit crises, commercial banks as well as domain researchers place great emphasis on risk management, credit risk management

  • We propose a credit risk assessment model based on BBHFS-imbalanceoriented multiple-classifier ensemble (IOMCE) by comprehensively considering the following: 1) the importance of optimizing feature selection in credit risk modeling; 2) the importance of determining an appropriate feature extraction rate; and 3) the reality that a credit data set typically consists of imbalanced samples

  • This paper proposes an integrated credit risk assessment method that combines BBHFS with the IOMCE and uses the support vector machine (SVM) and the multiple discriminant analysis (MDA) as the base predictor

Read more

Summary

Introduction

To prevent new financial crises derived from credit crises, commercial banks as well as domain researchers place great emphasis on risk management, credit risk management In this regard, the assessment of credit risk is a important issue. This study addresses three issues that have received little attention from previous research on credit risk assessment: the hybrid optimization of feature selection, the optimization of the feature extraction rate, and imbalanced modeling. A credit data set is typically imbalanced, which implies that the number of samples with good credit often exceeds that of samples with bad credit In this case, the results for the overall accuracy of a credit risk model trained and tested using imbalanced data sets may overestimate its performance.

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call