Abstract

This work proposes a data mining algorithm called Unordered Rule Sets using a continuous Ant-Miner algorithm. The goal of this work is to extract classification rules from data. Swarm intelligence (SI) is a technique whereby rules may be discovered through the study of collective behavior in decentralized, self-organized systems, such as ants. The Ant-Miner algorithm, first proposed by Parpinelli and his colleagues (2002), applies an ant colony optimization (ACO) heuristic to the classification task of data mining to discover an ordered list of classification rules. Ant-Miner is a rule-induction algorithm that uses SI techniques to form rules. Ant-Miner uses a discretization process to deal with continuous attributes in the data. Discretization transforms numeric attributes into nominal attributes. Discretization may suffer from a loss of information, as the real relationship underlying individual values of a numeric attribute is unknown. The objective of this work is to apply ACO heuristic techniques to discover unordered rule sets for mixed variables in a data set. The proposed algorithm handles both nominal and continuous attributes using multimodal functions. It has the advantage of discovering more modular rules, i.e., rules that can be interpreted independently from other rules - unlike the rules in an ordered list, where the interpretation of a rule requires knowledge of the previous rules in the list. The results provide evidence that the accuracy of the Unordered Rule Set Continuous Ant-Miner algorithm is competitive with other Ant-Miner versions and generates simpler rule sets.

Highlights

  • Data mining refers to the extraction of knowledge from large amounts of data

  • We have evaluated the performance of the algorithm with the existing Ordered and Unordered Rule Set Ant-Miner algorithms

  • We considered the original Ant-Miner and Unordered Rule Set Ant-Miner values as benchmark values

Read more

Summary

Introduction

Data mining refers to the extraction of knowledge from large amounts of data. Classification is one of the most frequently occurring tasks of human decision-making. Rule discovery is an important data mining task, as it generates a set of symbolic rules that describe each class or category in a natural way. These rules need to be simple and comprehensive; otherwise, a human is unable to comprehend them. To our knowledge, Parpinelli, Lopes, and Freitas (2002) were the first to propose Ant Colony Optimization (ACO) for discovering classification rules using the system Ant-Miner. They argue that an ant-based search is more flexible and robust than traditional approaches. We handle continuous attributes directly using multimodal functions and generate unordered rule sets

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call