A weighted-sum chaotic sparrow search algorithm for interdisciplinary feature selection and data classification

Liyun Jia,Tao Wang,Ahmed G Gad,Ahmed Salem

doi:10.1038/s41598-023-38252-0

Abstract

In today’s data-driven digital culture, there is a critical demand for optimized solutions that essentially reduce operating expenses while attempting to increase productivity. The amount of memory and processing time that can be used to process enormous volumes of data are subject to a number of limitations. This would undoubtedly be more of a problem if a dataset contained redundant and uninteresting information. For instance, many datasets contain a number of non-informative features that primarily deceive a given classification algorithm. In order to tackle this, researchers have been developing a variety of feature selection (FS) techniques that aim to eliminate unnecessary information from the raw datasets before putting them in front of a machine learning (ML) algorithm. Meta-heuristic optimization algorithms are often a solid choice to solve NP-hard problems like FS. In this study, we present a wrapper FS technique based on the sparrow search algorithm (SSA), a type of meta-heuristic. SSA is a swarm intelligence (SI) method that stands out because of its quick convergence and improved stability. SSA does have some drawbacks, like lower swarm diversity and weak exploration ability in late iterations, like the majority of SI algorithms. So, using ten chaotic maps, we try to ameliorate SSA in three ways: (i) the initial swarm generation; (ii) the substitution of two random variables in SSA; and (iii) clamping the sparrows crossing the search range. As a result, we get CSSA, a chaotic form of SSA. Extensive comparisons show CSSA to be superior in terms of swarm diversity and convergence speed in solving various representative functions from the Institute of Electrical and Electronics Engineers (IEEE) Congress on Evolutionary Computation (CEC) benchmark set. Furthermore, experimental analysis of CSSA on eighteen interdisciplinary, multi-scale ML datasets from the University of California Irvine (UCI) data repository, as well as three high-dimensional microarray datasets, demonstrates that CSSA outperforms twelve state-of-the-art algorithms in a classification task based on FS discipline. Finally, a 5%-significance-level statistical post-hoc analysis based on Wilcoxon’s signed-rank test, Friedman’s rank test, and Nemenyi’s test confirms CSSA’s significance in terms of overall fitness, classification accuracy, selected feature size, computational time, convergence trace, and stability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Aug 28, 2023
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A weighted-sum chaotic sparrow search algorithm for interdisciplinary feature selection and data classification

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Guest Editorial: Learning, optimisation and control of cyber‐physical systems
Jian Sun ... Guo‐Ping Liu
IET Cyber-Physical Systems: Theory & Applications | VOL. 7
Jian Sun, et. al.Jian Sun ... Guo‐Ping Liu
01 Dec 2022
IET Cyber-Physical Systems: Theory & Applications | VOL. 7

Global competition and technological transition in electrical, electronic, information and communication engineering: quantitative analysis of periodicals and conference proceedings of the IEEE
Nobuyuki Shirakawa ... Kumi Okuwada
Scientometrics | VOL. 91
Nobuyuki Shirakawa, et. al.Nobuyuki Shirakawa ... Kumi Okuwada
06 Dec 2011
Scientometrics | VOL. 91

A novel chaotic transient search optimization algorithm for global optimization, real-world engineering problems and feature selection.
Osman Altay ... Elif Varol Altay
PeerJ. Computer science | VOL. 9
Osman Altay, et. al.Osman Altay ... Elif Varol Altay
22 Aug 2023
PeerJ. Computer science | VOL. 9

Sensor Network based on IEEE 1451.0 and IEEE p1451.2-RS232
Eugene Y Song ... Kang B Lee
-
Eugene Y Song, et. al.Eugene Y Song ... Kang B Lee
01 May 2008
Sensor Network based on IEEE 1451.0 and IEEE p1451.2-RS232
Eugene Y Song ... Kang B Lee

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A weighted-sum chaotic sparrow search algorithm for interdisciplinary feature selection and data classification

Abstract

Talk to us

Similar Papers

More From: Scientific Reports