Incremental Ensemble Learning Model for Imbalanced Data: a Case Study of Credit Scoring

My Thi Thien Bui

doi:10.55579/jaec.202372.407

Abstract

Imbalanced data is a challenge for classification models. It reduces the overall performance of traditional learning algorithms. Besides, the minority class of imbalanced datasets is misclassified with a high ratio even though this is a crucial object of the classification process. In this paper, a new model called the Lasso-Logistic ensemble is proposed to deal with imbalanced data by utilizing two popular techniques, random over-sampling and random under-sampling. The model was applied to two real imbalanced credit data sets. The results show that the Lasso-Logistic ensemble model oﬀers better performance than the single traditional methods, such as random over-sampling, random under-sampling, Synthetic Minority Oversampling Technique (SMOTE), and cost-sensitive learning.This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium provided the original work is properly cited.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Advanced Engineering and Computation	Publication Date: Jun 30, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Incremental Ensemble Learning Model for Imbalanced Data: a Case Study of Credit Scoring

Abstract

Talk to us

Similar Papers

More From: Journal of Advanced Engineering and Computation

Lead the way for us

Similar Papers

Stroke Prediction with Machine Learning Methods among Older Chinese.
Yafei Wu ... Ya Fang
International Journal of Environmental Research and Public Health | VOL. 17
Yafei Wu, et. al.Yafei Wu ... Ya Fang
01 Mar 2020
International Journal of Environmental Research and Public Health | VOL. 17

Addressing the Big Data Multi-class Imbalance Problem with Oversampling and Deep Learning Neural Networks
V M González-Barcenas ... R M Valdovinos
-
V M González-Barcenas, et. al.V M González-Barcenas ... R M Valdovinos
01 Jan 2019
01 Jan 2019

Automated semiconductor wafer defect classification dealing with imbalanced data
Po-Hsuan Lee ... Wei Fang
-
Po-Hsuan Lee, et. al.Po-Hsuan Lee ... Wei Fang
20 Mar 2020
20 Mar 2020

Algorithms efficiency measurement on imbalanced data using geometric mean and cross validation
Mustakim Al Helal ... Mohammad Salman Haydar
-
Mustakim Al Helal, et. al.Mustakim Al Helal ... Mohammad Salman Haydar
01 Dec 2016
01 Dec 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Incremental Ensemble Learning Model for Imbalanced Data: a Case Study of Credit Scoring

Abstract

Talk to us

Similar Papers

More From: Journal of Advanced Engineering and Computation