Credit Risk Classification Research Articles

For the financial health of lenders and institutions, one important risk assessment called credit risk is about correctly deciding whether or not a borrower will fail to repay a loan. It not only helps in the approval or denial of loan applications but also aids in managing the non-performing loan (NPL) trend. In this study, a dataset provided by the LendingClub company based in San Francisco, CA, USA, from 2007 to 2020 consisting of 2,925,492 records and 141 attributes was experimented with. The loan status was categorized as “Good” or “Risk”. To yield highly effective results of credit risk prediction, experiments on credit risk prediction were performed using three widely adopted supervised machine learning techniques: logistic regression, random forest, and gradient boosting. In addition, to solve the imbalanced data problem, three sampling algorithms, including under-sampling, over-sampling, and combined sampling, were employed. The results show that the gradient boosting technique achieves nearly perfect Accuracy, Precision, Recall, and F1score values, which are better than 99.92%, but its MCC values are greater than 99.77%. Three imbalanced data handling approaches can enhance the model performance of models trained by three algorithms. Moreover, the experiment of reducing the number of features based on mutual information calculation revealed slightly decreasing performance for 50 data features with Accuracy values greater than 99.86%. For 25 data features, which is the smallest size, the random forest supervised model yielded 99.15% Accuracy. Both sampling strategies and feature selection help to improve the supervised model for accurately predicting credit risk, which may be beneficial in the lending business.

In this paper, a dual-voting-based learning paradigm is proposed to solve attribute noise problem in credit risk classification. In the proposed learning paradigm, three stages are involved. In the first stage, four indexes are introduced to evaluate the noise level of attributes. In the second stage, attributes with different noise levels are divided into different attribute sets in accordance with the dual-voting results of noise level. In the third stage, credit datasets with different attributes sets are dealt with different learning strategies and different de-noising methods for comparison purpose. In the proposed learning paradigm, a classification and regression tree (CART) model is adopted as the generic classifier to evaluate the performance on training datasets generated by different learning strategies and noise reduction methods. In addition, the performance of all learning strategies on sparse data with attribute noise is also discussed. Experimental results show that the proposed learning paradigm performs better than the benchmarks to solve the attribute noise problem not only in accuracy and its stability, but also in speediness. Further analysis indicates that the sparse data with attribute noise can further improve the stability of accuracy for a specific de-noising method. This implies that the proposed dual voting-based learning paradigm is a promising solution to attribute noise reduction in credit risk classification.

Credit Risk Classification Research Articles

Related Topics

Articles published on Credit Risk Classification

Credit Risk Classification and Prediction Based on Deep Neural Network Algorithm

Credit Risk Classification Prediction Based on Optimised Adaboost Algorithm with Long Short-Term Memory Neural Network (LSTM)

A SC Financial Credit Risk Assessment Model Based on Particle Filter and SVM with Gain Information

Semi-supervised heterogeneous domain adaptation for few-sample credit risk classification

Enhancing banking governance: A machine learning-based credit risk classification

Enhancing Supervised Model Performance in Credit Risk Classification Using Sampling Strategies and Feature Ranking

A shapelet-based behavioral pattern extraction method for credit risk classification with behavior sparsity

IMPROVED SUPPORT VECTOR MACHINE PERFORMANCE USING PARTICLE SWARM OPTIMIZATION IN CREDIT RISK CLASSIFICATION

Forecasting financial markets and credit risk classification using genetic folding algorithm

Forecasting financial markets and credit risk classification using genetic folding algorithm

A Gaussian process‐based approach toward credit risk modeling using stationary activations

Credit risk classification: an integrated predictive accuracy algorithm using artificial and deep neural networks

Evaluation of SMEs’ Credit Decision Based on Support Vector Machine-Logistics Regression

Missing Data Preprocessing in Credit Classification: One-Hot Encoding or Imputation?

Can machine learning paradigm improve attribute noise problem in credit risk classification?

Domain Adaptation Learning Based on Structural Similarity Weighted Mean Discrepancy for Credit Risk Classification

A VNS-EDA Algorithm-Based Feature Selection for Credit Risk Classification

Credit Risk Classification in Peer-to-Peer Marketplaces: The Nexus of Neural Network Approach

A DBN-based resampling SVM ensemble learning paradigm for credit classification with imbalanced data

Neural Networks in Credit Risk Classification of Companies in the Construction Sector

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Credit Risk Classification Research Articles

Related Topics

Articles published on Credit Risk Classification

Credit Risk Classification and Prediction Based on Deep Neural Network Algorithm

Credit Risk Classification Prediction Based on Optimised Adaboost Algorithm with Long Short-Term Memory Neural Network (LSTM)

A SC Financial Credit Risk Assessment Model Based on Particle Filter and SVM with Gain Information

Semi-supervised heterogeneous domain adaptation for few-sample credit risk classification

Enhancing banking governance: A machine learning-based credit risk classification

Enhancing Supervised Model Performance in Credit Risk Classification Using Sampling Strategies and Feature Ranking

A shapelet-based behavioral pattern extraction method for credit risk classification with behavior sparsity

IMPROVED SUPPORT VECTOR MACHINE PERFORMANCE USING PARTICLE SWARM OPTIMIZATION IN CREDIT RISK CLASSIFICATION

Forecasting financial markets and credit risk classification using genetic folding algorithm

Forecasting financial markets and credit risk classification using genetic folding algorithm

A Gaussian process‐based approach toward credit risk modeling using stationary activations

Credit risk classification: an integrated predictive accuracy algorithm using artificial and deep neural networks

Evaluation of SMEs’ Credit Decision Based on Support Vector Machine-Logistics Regression

Missing Data Preprocessing in Credit Classification: One-Hot Encoding or Imputation?

Can machine learning paradigm improve attribute noise problem in credit risk classification?

Domain Adaptation Learning Based on Structural Similarity Weighted Mean Discrepancy for Credit Risk Classification

A VNS-EDA Algorithm-Based Feature Selection for Credit Risk Classification

Credit Risk Classification in Peer-to-Peer Marketplaces: The Nexus of Neural Network Approach

A DBN-based resampling SVM ensemble learning paradigm for credit classification with imbalanced data

Neural Networks in Credit Risk Classification of Companies in the Construction Sector