Abstract

Credit scoring is an important area for financial risk management, where a small improvement in prediction performance will save significant losses for financial institutions. This article proposes a two-stage framework for credit scoring based on feature augmentation and dimension reduction to improve the model performance. In this framework, the logarithm marginal density ratios transformation is employed to provide more prominent features for credit scoring, fully utilizing original data information. For the dimension reduction process, we apply three classifiers for penalized logistic regression (add a sum of the parameter absolute value to the criterion function), extreme gradient boosting with regularization term, and k-nearest neighbor with sequential forward selection to two different credit data sets. The results indicate that this two-stage framework can improve the performance of credit scoring models.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call