Abstract

C5.0 classifier is optimized using Bayesian theory. The C5.0 algorithm is a classifier that discovers patterns in data and uses them to make accurate predictions. It classifies data objects based on the information gain of its attributes. Though it responds to noisy and missing data, its accuracy can be improved upon. This research work proposes a post pruning decision tree algorithm that will use C5.0 as its base and Bayesian posterior theory as an enhancer. Post pruning is performed by evaluating the decision tree using Bayesian posterior theory. Bayesian theory uses probability to judge the relative validity of hypothesis in terms of noisy and uncertain data. The proposed algorithm attempts to support low memory usage, higher accuracy and improved speed with the help of smaller decision trees. It will also reduce the risks associated with over-fitting.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.