A novel generalised extreme value gradient boosting decision tree for the class imbalanced problem in credit scoring

Junfeng Zhang,Raffaella Calabrese,Yizhe Dong

doi:10.1080/01605682.2024.2418882

Junfeng Zhang, Raffaella Calabrese + Show 1 more

Open Access

https://doi.org/10.1080/01605682.2024.2418882

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

The performance of credit scoring models can be compromised when dealing with imbalanced datasets, where the number of defaulted borrowers is significantly lower than that of non-defaulters. To address this challenge, we propose a gradient boosting decision tree with the generalised extreme value distribution model (GEV-GBDT). Our approach replaces the conventional symmetric logistic sigmoid function with the asymmetric cumulative distribution function of the GEV distribution as the activation function. We derive a novel loss function based on the maximum likelihood estimation of the GEV distribution within the boosting framework. This modification allows the model to focus more on the minority class by emphasising the tail of the response curve, and the shape parameter of the GEV distribution offers flexibility in controlling the model’s emphasis on minority samples. We examine the performance of this approach using four real-life loan datasets. The empirical results show that the GEV-GBDT model achieves superior classification performance compared to other commonly used imbalanced learning methods, including the synthetic minority oversampling technique and the cost-sensitive framework. Furthermore, we conduct performance tests on several datasets with varying imbalance ratios and find that GEV-GBDT performs better on extremely imbalanced datasets.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

A novel generalised extreme value gradient boosting decision tree for the class imbalanced problem in credit scoring

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of the Operational Research Society

Lead the way for us

Journal: Journal of the Operational Research Society	Publication Date: Oct 22, 2024
License type: cc-by

Similar Papers

Imbalance Learning and Its Application on Medical Datasets
Yachao Shao
-
Yachao ShaoYachao Shao
21 Feb 2022
21 Feb 2022

RSMOTE: improving classification performance over imbalanced medical datasets.
Mehdi Naseriparsa ... Yong Zhang
Health Information Science and Systems | VOL. 8
Mehdi Naseriparsa, et. al.Mehdi Naseriparsa ... Yong Zhang
12 Jun 2020
Health Information Science and Systems | VOL. 8

Extreme Learning Machine Enhanced Gradient Boosting for Credit Scoring
Yao Zou ... Changchun Gao
Algorithms | VOL. 15
Yao Zou, et. al.Yao Zou ... Changchun Gao
27 Apr 2022
Algorithms | VOL. 15

CDBH: A clustering and density-based hybrid approach for imbalanced data classification
Behzad Mirzaei ... Hossein Nezamabadi-Pour
Expert Systems with Applications | VOL. 164
Behzad Mirzaei, et. al.Behzad Mirzaei ... Hossein Nezamabadi-Pour
28 Sep 2020
Expert Systems with Applications | VOL. 164

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

A novel generalised extreme value gradient boosting decision tree for the class imbalanced problem in credit scoring

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of the Operational Research Society