Abstract

AbstractMany fields function with large databases constitute a high number of features. Feature selection strategies seek to exclude the features that are distracting, repetitive, or unnecessary, as they can degrade the classification results. Existing approaches lack the scalability needed to handle the datasets with millions of instances and they do not obtain favorable results in a timely manner. This study uses a unique feature selection approach based on an upgraded optimization model and deep machine learning‐based data classification. “(a) Feature extraction, (b) optimal feature selection, and (c) classification” are the three stages of the proposed model. Initially, the extracted big‐datasets are efficiently handled by the parallel pool map‐reduce architecture. Several features from the input big‐data are extracted using feature extraction (FE) approaches such as the suggested Tri‐Kernel principal component analysis (TK‐PCA), linear discriminant analysis, and linear square regression. Furthermore, the data obtained characteristics may contain data that is irrelevant, out‐of‐date, or noisy. The computing cost rises due to the larger feature space. As a result, the best features are selected using a new optimization technique known as Levy Adapted SLnO (LA‐SLnO), which is a superior variant of the original SLnO algorithm. This selection of appropriate features improves the classification accuracy. For classification, Convolutional Neural Network is used in this work. Finally, a comparative evaluation is undergone to validate the efficiency of the proposed model.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call