Decision Tree Application to Classification Problems with Boosting Algorithm

Long Zhao,Sanghyuk Lee,Seon-Phil Jeong

doi:10.3390/electronics10161903

Long Zhao, Sanghyuk Lee + Show 1 more

Open Access

https://doi.org/10.3390/electronics10161903

Copy DOI

Abstract

A personal credit evaluation algorithm is proposed by the design of a decision tree with a boosting algorithm, and the classification is carried out. By comparison with the conventional decision tree algorithm, it is shown that the boosting algorithm acts to speed up the processing time. The Classification and Regression Tree (CART) algorithm with the boosting algorithm showed 90.95% accuracy, slightly higher than without boosting, 90.31%. To avoid overfitting of the model on the training set due to unreasonable data set division, we consider cross-validation and illustrate the results with simulation; hypermeters of the model have been applied and the model fitting effect is verified. The proposed decision tree model is fitted optimally with the help of a confusion matrix. In this paper, relevant evaluation indicators are also introduced to evaluate the performance of the proposed model. For the comparison with the conventional methods, accuracy rate, error rate, precision, recall, etc. are also illustrated; we comprehensively evaluate the model performance based on the model accuracy after the 10-fold cross-validation. The results show that the boosting algorithm improves the performance of the model in accuracy and precision when CART is applied, but the model fitting time takes much longer, around 2 min. With the obtained result, it is verified that the performance of the decision tree model is improved under the boosting algorithm. At the same time, we test the performance of the proposed verification model with model fitting, and it could be applied to the prediction model for customers’ decisions on subscription to the fixed deposit business.

Highlights

As a classification function approximation method, the decision tree is developed from the field of machine learning [1]
The decision tree algorithm gradually developed a series of algorithms, such as Iterative Dichotomizer3 (ID3) algorithm, C4.5 algorithm, C5.0 algorithm, Classification and Regression Tree (CART) algorithm, and so on [6]
The algorithms used in this paper are C5.0 algorithm and CART algorithm, both of which are evolved from the previous algorithm, and their comprehensive performance has been improved [6]

Summary

Introduction

As a classification function approximation method, the decision tree is developed from the field of machine learning [1]. Hunt et al proposed that the concept learning system is the earliest decision tree algorithm [5]. The decision tree algorithm gradually developed a series of algorithms, such as Iterative Dichotomizer (ID3) algorithm, C4.5 algorithm, C5.0 algorithm, Classification and Regression Tree (CART) algorithm, and so on [6]. C5.0 algorithm is an intuitive and efficient classification method, but it has the problems of information gain rate calculation complexity, and is prone to overfitting and decision tree bias. To solve these problems, the calculation process of the information gain rate is simplified by formula transformation. A classifier ensemble was proposed to enhance diversity, and it provided a near-optimal classifying system [8,9]

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Aug 8, 2021
Citations: 25	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Decision Tree Application to Classification Problems with Boosting Algorithm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

HYPER HEURISTIC EVOLUTIONARY APPROACH FOR CONSTRUCTING DECISION TREE CLASSIFIERS
Sunil Kumar ... Saroj Ratnoo
Journal of Information and Communication Technology | VOL. 20
Sunil Kumar, et. al.Sunil Kumar ... Saroj Ratnoo
01 Jan 2020
Journal of Information and Communication Technology | VOL. 20

Construction and validation of a decision tree based on biomarkers for predicting severe acute kidney injury in critically ill patients
Ruibin Chi ... Zhigang Jian
Zhonghua wei zhong bing ji jiu yi xue | VOL. 32
Ruibin Chi, et. al.Ruibin Chi ... Zhigang Jian
01 Jun 2020
Zhonghua wei zhong bing ji jiu yi xue | VOL. 32

Detection of subclinical keratoconus using a novel combined tomographic and biomechanical model based on an automated decision tree
Peng Song ... Pei Li
Scientific Reports | VOL. 12
Peng Song, et. al.Peng Song ... Pei Li
29 Mar 2022
Scientific Reports | VOL. 12

Vehicle classification with single multi-functional magnetic sensor and optimal MNS-based CART
Haijian Li ... Moyu Ren
Measurement | VOL. 55
Haijian Li, et. al.Haijian Li ... Moyu Ren
04 May 2014
Measurement | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decision Tree Application to Classification Problems with Boosting Algorithm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics