Abstract

Bat algorithm is one of the optimization techniques that mimic the behavior of bat. Bat algorithm is a powerful algorithm in finding the optimum feature data collection. Classification is one of the data mining tasks that useful in knowledge representation. But, the high dimensional data become the issue in the classification that interrupt classification accuracy. From the literature, feature selection and discretization able to overcome the problem. Therefore, this study aims to show Bat algorithm is potential as a discretization approach and as a feature selection to improve classification accuracy. In this paper, a new hybrid Bat-K-Mean algorithm refer as hBA is proposed to convert continuous data into discrete data called as optimize discrete dataset. Then, Bat is used as feature selection to select the optimum feature from the optimized discrete dataset in order to reduce the dimension of data. The experiment is conducted by using k-Nearest Neighbor to evaluate the effectiveness of discretization and feature selection in classification by comparing with continuous dataset without feature selection, discrete dataset without feature selection, and continuous dataset without discretization and feature selection. Also, to show Bat is potential as a discretization approach and feature selection method. . The experiments were carried out using a number of benchmark datasets from the UCI machine learning repository. The results show the classification accuracy is improved with the Bat-K-Means optimized discretization and Bat optimized feature selection.

Highlights

  • Bat Algorithm (BA) is a metaheuristic optimization algorithm that is found on echolocation characteristics of microbats where the emission and loudness has a varying pulse rates [1]

  • The experiments are set up to show Bat algorithm is potential as discretization approach and as feature selection to improve classification accuracy

  • The Bat algorithm is a powerful algorithm in finding the optimum feature data collection

Read more

Summary

Introduction

Bat Algorithm (BA) is a metaheuristic optimization algorithm that is found on echolocation characteristics of microbats where the emission and loudness has a varying pulse rates [1]. BA could possibly perform better than Particle Swarm Optimization (PSO) [2]. Unlike PSO, the parameter of BA can be varied. In [3] BA was used for classification in medical data. This research [4] objective is to improve the classification performance in large data. This objective is achieved by employing BA as feature selection to select important feature in real-life dataset from UCI repository. WELM-BAT was tested on real world medical diagnosis datasets and the result shows the performance of classification is improved

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call