Performance analysis of feature selection and classification in Big Data Information extraction

Manjunatha Swamy, C,Dr Lokesh, M R,Dr S Meenakshi Sundaram

doi:10.36348/sjet.2023.v08i03.002

Abstract

Purpose: Information extraction from big data is improved by either reducing the number of features in a data set or selecting features using intelligent data analysis. Generally, big data sets are complex to process using traditional approaches. Feature selection is highly essential in big data information extraction because it chooses the subset of features that influence the final classification. Reducing the number of selected features in the data leads to enhanced accuracy and efficiency of data extraction with other attributes used in the mathematical model. This work aims to improve the performance of the classifier using an enhanced binary bat algorithm-based effective feature selection model. formulated to enhance accuracy, efficiency of data extraction with other attributes. An enhanced binary bat algorithm (EBBA) proposed to solve the mentioned problem using local optimization and global optimization factor which improves the performance of optimization. Experiment carried out with different datasets selected to test effective performance of proposed algorithm and demonstrated performance is better with other algorithms. Design: The purpose of this paper is to provide, an effective feature selection model for big data information extraction. An enhanced binary bat algorithm has been proposed to improve attribute selection using local optimization and global optimization methods. Classification of multisource data using selected features using labeled approach. Particular Information extraction for multi view multi label (PIMM) approach is compared with EBBA algorithm. Further to enhance effectiveness of shared and specific information in big data [3] by setting the delta and omega factors in order to fuse different information from different view point, Online analysis of relevance with any redundancy analysis also been incorporated. Findings: All the experiments were carried out with different datasets on the number of iterations and fitness of the attributes to validate the effective performance of the proposed algorithm. Experimental results and graphs show that the proposed methodology improves the overall performance of optimization using PIMM models. Originality: A feature selection model based on the binary bat algorithm has been the focus of this paper. Subset selection and feature ranking are the two important methods used in this approach. Experiments were conducted on datasets to analyze the patterns in the number of iterations and fitness of the attributes over selection. The improvement in feature selection leads to better classification accuracy of the proposed model compared to other nature inspired techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance analysis of feature selection and classification in Big Data Information extraction

Abstract

Talk to us

Similar Papers

More From: Saudi Journal of Engineering and Technology

Lead the way for us

Similar Papers

EBBA: An Enhanced Binary Bat Algorithm Integrated with Chaos Theory and Lévy Flight for Feature Selection
Jinghui Feng ... Haopeng Kuang
Future Internet | VOL. 14
Jinghui Feng, et. al.Jinghui Feng ... Haopeng Kuang
09 Jun 2022
Future Internet | VOL. 14

An Efficient Binary Clonal Selection Algorithm with Optimum Path Forest for Feature Selection
Emad Nabil ... Safinaz Abdel-Fattah
International Journal of Advanced Computer Science and Applications | VOL. 11
Emad Nabil, et. al.Emad Nabil ... Safinaz Abdel-Fattah
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 11

A Binary Bat Approach for Identification of Fatigue Condition from sEMG Signals
Navaneethakrishna Makaram ... Ramakrishnan Swaminathan
-
Navaneethakrishna Makaram, et. al.Navaneethakrishna Makaram ... Ramakrishnan Swaminathan
01 Jan 2015
01 Jan 2015

Credit card fraud detection using the brown bear optimization algorithm
Shaymaa E Sorour ... Amr A Abd El-Mageed
Alexandria Engineering Journal | VOL. 104
Shaymaa E Sorour, et. al.Shaymaa E Sorour ... Amr A Abd El-Mageed
24 Jun 2024
Alexandria Engineering Journal | VOL. 104

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance analysis of feature selection and classification in Big Data Information extraction

Abstract

Talk to us

Similar Papers

More From: Saudi Journal of Engineering and Technology