PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning

Rajeev Rastogi,Kyuseok Shim

doi:10.1023/a:1009887311454

Abstract

Classification is an important problem in data mining. Given a database of records, each with a class label, a classifier generates a concise and meaningful description for each class that can be used to classify subsequent records. A number of popular classifiers construct decision trees to generate class models. These classifiers first build a decision tree and then prune subtrees from the decision tree in a subsequent i>pruning phase to improve accuracy and prevent “overfitting”. Generating the decision tree in two distinct phases could result in a substantial amount of wasted effort since an entire subtree constructed in the first phase may later be pruned in the next phase. In this paper, we propose PUBLIC, an improved decision tree classifier that integrates the second “pruning” phase with the initial “building” phase. In PUBLIC, a node is not expanded during the building phase, if it is determined that it will be pruned during the subsequent pruning phase. In order to make this determination for a node, before it is expanded, PUBLIC computes a lower bound on the minimum cost subtree rooted at the node. This estimate is then used by PUBLIC to identify the nodes that are certain to be pruned, and for such nodes, not expend effort on splitting them. Experimental results with real-life as well as synthetic data sets demonstrate the effectiveness of PUBLIC's integrated approach which has the ability to deliver substantial performance improvements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery

Lead the way for us

Journal: Data Mining and Knowledge Discovery	Publication Date: Jan 1, 2000
Citations: 128

Similar Papers

Evaluation of Decision Tree Pruning Algorithms for Complexity and Classification Accuracy
Dipti D Patil ... J.A Gokhale
International Journal of Computer Applications | VOL. 11
Dipti D Patil, et. al.Dipti D Patil ... J.A Gokhale
10 Dec 2010
International Journal of Computer Applications | VOL. 11

Hybrid decision tree and naïve Bayes classifiers for multi-class classification tasks
Dewan Md Farid ... Rebecca Strachan
Expert Systems with Applications | VOL. 41
Dewan Md Farid, et. al.Dewan Md Farid ... Rebecca Strachan
11 Sep 2013
Expert Systems with Applications | VOL. 41

Improving Classification Accuracy of Decision Trees for Different Abstraction Levels of Data
Mina Jeong ... Doheon Lee
International Journal of Data Warehousing and Mining | VOL. 1
Mina Jeong, et. al.Mina Jeong ... Doheon Lee
01 Jul 2005
International Journal of Data Warehousing and Mining | VOL. 1

Improving Classification Accuracy of Decision Trees for Different Abstraction Levels of Data
Mina Jeong ... Doheon Lee
-
Mina Jeong, et. al.Mina Jeong ... Doheon Lee
01 Jan 2008
01 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery