Bayesian Additive Regression Trees for Classification of Unbalanced Class of Credit Collectability Data

Hafizh Iman Naufal,Eni Sumarminingsih,Achmad Efendi

doi:10.9734/ajpas/2023/v23i1494

Abstract

Aims: This study aims at determining the classification results using the Bayesian Additive Regression Trees (BART) method on bank credit collectability data, where there is a class imbalance in the data. Study Design: Quantitative Design. Place and Duration of Study: The used data are secondary data in the form of bank debtor’s credit collectability data with nine predictor variables and one response variable in the form of credit collectability. They are collected from Banks in East Java, Indonesia, from the date of 01 May 1986 to 31 May 2018. Methodology: The Bayesian approach is one of the estimation methods in statistics that is currently being popularly used, this is because the rapid development of technology makes computational challenges no longer a problem. The Bayesian estimation continues to develop and can be used in various statistical methods, for instance both for regression and classification. The Classification and Regression Trees (CART) method is one of the most popular classification methods used. Debtors, in a bank, who have delinquent credit have a small proportion compared to debtors who have current credit. Standard classifier methods such as CART are not suitable for handling this case, as CART is sensitive to classes that have a high degree. Hence, additional methods such as ensemble BART (Bayesian Additive Regression Trees), are needed in order to increase the accuracy of classification in cases of class imbalance. Results: The results of the cross-validation on the BART show a high consistency of classification accuracy, 83.49%. This indicates that the BART method can work consistently even though there is a class imbalance. The results of this study indicate that the classification accuracy of the training data is 84.53%, while the accuracy in the testing data is 85.48%. These results also show that the BART method has ability to overcome overfitting in the classification method, where overfitting often occurs in most of the classification methods that have very good classification abilities. Conclusion: The testing data show that the accuracy is relatively similar to the one of the training data, this indicates that the BART method has been able to capture patterns in the data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bayesian Additive Regression Trees for Classification of Unbalanced Class of Credit Collectability Data

Abstract

Talk to us

Similar Papers

More From: Asian Journal of Probability and Statistics

Lead the way for us

Similar Papers

Statistical comparison of additive regression tree methods on ecological grassland data
Emily Plant ... Jarrod Kath
Ecological Informatics | VOL. 61
Emily Plant, et. al.Emily Plant ... Jarrod Kath
12 Nov 2020
Ecological Informatics | VOL. 61

Embarcadero: Species distribution modelling with Bayesian additive regression trees in r
Colin J Carlson
Methods in Ecology and Evolution | VOL. 11
Colin J CarlsonColin J Carlson
16 Apr 2020
Methods in Ecology and Evolution | VOL. 11

Detection of Left Ventricular Hypertrophy Using Bayesian Additive Regression Trees: The MESA (Multi‐Ethnic Study of Atherosclerosis)
-
Journal of the American Heart Association | VOL. 8
--
04 May 2019
Journal of the American Heart Association | VOL. 8

Genome-wide prediction using Bayesian additive regression trees
Patrik Waldmann
Genetics Selection Evolution | VOL. 48
Patrik WaldmannPatrik Waldmann
10 Jun 2016
Genetics Selection Evolution | VOL. 48

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian Additive Regression Trees for Classification of Unbalanced Class of Credit Collectability Data

Abstract

Talk to us

Similar Papers

More From: Asian Journal of Probability and Statistics