Abstract

This paper introduces a novel classification rule mining model based on Pareto-based Multiobjective Optimization called CRM-PM. The process of rule extraction is a challenging classification task in data mining since it has several constraints and conflicting objectives such as accuracy and comprehensibility. In this study, this task is accepted as a multi-objective optimization problem. Classification accuracy and misclassification ratio are assigned as evaluation criteria. The candidate solutions are generated in the direction of a proposed strategy to determine optimal ranges of the attributes that form the rules. The proposed approach is applied on eight benchmark datasets (Iris Plants, Wine Quality, Glass Identification, Stat log (Heart), Haberman’s Survival, E-coli, Wisconsin Breast Cancer, and Pima Indians Diabetes) included in the University of California at Irvine machine learning repository. Furthermore, CRM-PM is run in three different validation modes: cross-validation, training without test data, and training with random splitting. Regarding experimental results, it can be said that the presented method has a promising capability for classification, and it achieves comparative or superior results.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call