Abstract

Since the decision trees (DTs) have an advantage over “black-box” models, such as neural nets or support vector machines, in terms of comprehensibility, such that it might merit improvement for further optimization. The node splitting measures and pruning methods are primary among the techniques that can improve the generalization abilities of DTs. Here, we introduced the unequal interval optimization for node splitting, as well as the local chi-square test for tree pruning. This new method was named an adaptive multi-branch decision tree (CMDT). 11 benchmark data sets with different scales were chosen from UCI Machine Learning Repository and coupled with 12 classifiers to evaluate the CMDT algorithm. The results showed that CMDT can be more reliable than the twelve comparative approaches, especially for imbalanced datasets. We also discussed the performance metrics and the weighted decision-making table in unbalanced data sets. The CMDT algorithm can be found here: https://github.com/chenyuan0510/CMDT .

Highlights

  • With the advancement of science and technology, machine learning has been widely employed for classification and recognition tasks in many domains [1]

  • Ke et al proposed an improved Gradient Boosting Decision Tree (GBDT) called LightGBM that could speed up the training process of conventional GBDT by up to over 20 times while achieving almost the same accuracy [6]

  • We proposed a Chi-maximal information coefficient (MIC)-based adaptive multibranch decision tree (CMDT), which can handle mixedtype attributes

Read more

Summary

A Chi-MIC Based Adaptive Multi-branch Decision Tree

Jiahao Ye1, Jingjing Yang, Jiang Yu1, Siqiao Tan, Feng Luo, Zheming Yuan and Yuan Chen.

INTRODUCTION
BACKGROUND
METHODS AND MATERIALS
GROWING OF THE TREE
WEIGHTED DECISION-MAKING FOR IMBALANCED
Comparison of independent prediction accuracy among different models
DISCUSSIONS
Findings
CONCULASION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call