The data sampling effect on financial distress prediction by single and ensemble learning techniques

Kuen-Liang Sue,Chih-Fong Tsai,Andy Chiu

doi:10.1080/03610926.2021.1992439

Abstract

Financial distress domain problem datasets are usually class imbalanced. In literature, data sampling is one of the widely used solutions to deal with the class imbalance problem. This article focuses on examining the data sampling effect on financial distress prediction models by single and ensemble learning techniques. The experimental datasets are based on three bankruptcy prediction and credit scoring datasets and twelve different single classifiers and classifier ensembles are constructed. We find that although some prediction models trained by the original class imbalanced datasets provide reasonable AUC, their type II errors are very high for the practical usage. However, when data sampling is performed over the datasets, all of the prediction models can slightly increase their AUC and largely reduce their type II errors. More specifically, the decision tree ensembles by bagging and boosting methods are the better choices for financial distress prediction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The data sampling effect on financial distress prediction by single and ensemble learning techniques

Abstract

Talk to us

Similar Papers

More From: Communications in Statistics - Theory and Methods

Lead the way for us

Journal: Communications in Statistics - Theory and Methods	Publication Date: Oct 13, 2021
Citations: 5

Similar Papers

Predicting financial distress and corporate failure: A review from the state-of-the-art definitions, modeling, sampling, and featuring approaches
Jie Sun ... Kai-Yu He
Knowledge-Based Systems | VOL. 57
Jie Sun, et. al.Jie Sun ... Kai-Yu He
13 Dec 2013
Knowledge-Based Systems | VOL. 57

Combining feature selection, instance selection, and ensemble classification techniques for improved financial distress prediction
Chih-Fong Tsai ... Andy Chiu
Journal of Business Research | VOL. 130
Chih-Fong Tsai, et. al.Chih-Fong Tsai ... Andy Chiu
30 Mar 2021
Journal of Business Research | VOL. 130

Dynamic class-imbalanced financial distress prediction based on case-based reasoning integrated with time weighting and resampling
Jie Sun ... Mengru Zhao
Journal of Credit Risk | VOL. -
Jie Sun, et. al.Jie Sun ... Mengru Zhao
01 Jan 2023
Journal of Credit Risk | VOL. -

A novel classifier ensemble approach for financial distress prediction
Deron Liang ... Chih-Fong Tsai
Knowledge and Information Systems | VOL. 54
Deron Liang, et. al.Deron Liang ... Chih-Fong Tsai
11 May 2017
Knowledge and Information Systems | VOL. 54

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The data sampling effect on financial distress prediction by single and ensemble learning techniques

Abstract

Talk to us

Similar Papers

More From: Communications in Statistics - Theory and Methods