Прогнозирование банкротств предприятий с помощью экстремального градиентного бустинга

Мокеев Владимир Викторович ,Войтецкий Роман Владимирович

doi:10.14529/cmse200305

Abstract

The application of models for forecasting bankruptcy of enterprises for controlling investment is the basis for monitoring activities of financial institutions. A crucial factor in allowing financial institutions to determine the amount of capital to cover credit losses is the accuracy of the forecast. Most studies use traditional statistical methods (for example, linear discriminant analysis and logistic regression) to build models of enterprise bankruptcy forecasting, but the accuracy of these models is usually quite low. The reason for that is the imbalanced nature of training data sets (the share of bankrupt firms is a small percent of the total number of firms). Nowadays, such machine learning methods as the random forest and the gradient boosting are becoming widespread. This study focuses on the use of extreme gradient boosting to predict bankruptcy. Extreme gradient boosting, using LASSO or Ridge regularization, penalizes complex models to avoid overfitting. Also, during training, extreme gradient boosting fills in the missing values of the data set, depending on the value of the loss. In this article, we proposed SMOTE technique to enhance the minority class of the training data sets, which helps to improve the performance of extreme gradient boosting. The experiment results of improved extreme gradient boosting are compared to the outcomes obtained by other methods.

Highlights

Инвестиционные риски являются основной проблемой для финансовых учреждений, что заставляет их проверять и контролировать финансовую платежеспособность предприятия
This study focuses on the use of extreme gradient boosting to predict bankruptcy
We proposed Synthetic Minority Oversampling Technique (SMOTE) technique to enhance the minority class of the training data sets, which helps to improve the performance of extreme gradient boosting

Summary

Экстремальный градиентный бустинг

Экстремальный градиентный бустинг (Extreme Gradient Boosting, XGB) представляет развитие метода градиентного бустинга [20, 21]. Также в ходе обучения экстремальный градиентный бустинг использует алгоритм заполнения пропущенных значений в зависимости от величины потерь. Увеличение числа деревьев также повышает сложность модели, но при этом появляется возможность повысить точность получаемых решений. Для повышения качества прогнозирования банкротств предлагается использовать при построении моделей методом экстремального градиентного бустинга метод улучшения сбалансированности обучающей выборки. Для создания нового образца находят разность d = XD − XF, где XF, XD — это векторы признаков «соседних» образцов a и b класса предприятий банкротов. Для обучения модели используется процедура кросс-валидации, в рамках которой набор делится на К блоков (folds). Валидационный набор используется для оценки качества обучения. Метод SMOTE органично встраивается в схему обучения моделей методом экстремального градиентного бустинга и может представлять улучшенную версию экстремального градиентного бустинга

Метрика качества

Набор данных

Предварительная обработка данных

Findings

Discussion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Прогнозирование банкротств предприятий с помощью экстремального градиентного бустинга

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bulletin of the South Ural State University. Series "Computational Mathematics and Software Engineering"

Lead the way for us

Journal: Bulletin of the South Ural State University. Series "Computational Mathematics and Software Engineering"	Publication Date: Aug 1, 2020
License type: cc-by

Similar Papers

Artificial intelligence based system for predicting permanent stoma after sphincter saving operations
Chih-Yu Kuo ... Yen‑Kuang Lin
Scientific Reports | VOL. 13
Chih-Yu Kuo, et. al.Chih-Yu Kuo ... Yen‑Kuang Lin
25 Sep 2023
Scientific Reports | VOL. 13

Methodological progress note: Machine learning methods in healthcare research.
Colin Rogerson ... Matt Hall
Journal of Hospital Medicine | VOL. 18
Colin Rogerson, et. al.Colin Rogerson ... Matt Hall
13 Mar 2023
Journal of Hospital Medicine | VOL. 18

Machine Learning Approaches for Predicting Hypertension and Its Associated Factors Using Population-Level Data From Three South Asian Countries
Sheikh Mohammed Shariful Islam ... Liliana Laranjo
Frontiers in Cardiovascular Medicine | VOL. 9
Sheikh Mohammed Shariful Islam, et. al.Sheikh Mohammed Shariful Islam ... Liliana Laranjo
31 Mar 2022
Frontiers in Cardiovascular Medicine | VOL. 9

Performance Evaluation of Machine Learning Approaches for Credit Scoring
...
International Journal of Economics, Finance and Management Sciences | VOL. 6
, et. al. ...
12 Dec 2018
International Journal of Economics, Finance and Management Sciences | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Прогнозирование банкротств предприятий с помощью экстремального градиентного бустинга

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bulletin of the South Ural State University. Series "Computational Mathematics and Software Engineering"