Gene selection and cancer classification using interaction-based feature clustering and improved-binary Bat algorithm

Ahmad Esfandiari,Niki Nasiri

doi:10.1016/j.compbiomed.2024.109071

Abstract

In high-dimensional gene expression data, selecting an optimal subset of genes is crucial for achieving high classification accuracy and reliable diagnosis of diseases. This paper proposes a two-stage hybrid model for gene selection based on clustering and a swarm intelligence algorithm to identify the most informative genes with high accuracy. First, a clustering-based multivariate filter approach is performed to explore the interactions between the features and eliminate any redundant or irrelevant ones. Then, by controlling for the problem of premature convergence in the binary Bat algorithm, the optimal gene subset is determined using different classifiers with the Monte Carlo cross-validation data partitioning model. The effectiveness of our proposed framework is evaluated using eight gene expression datasets, by comparison with other recently published algorithms in the literature. Experiments confirm that in seven out of eight datasets, the proposed method can achieve superior results in terms of classification accuracy and gene subset size. In particular, it achieves a classification accuracy of 100% in Lymphoma and Ovarian datasets and above 97.4% in the rest with a minimum number of genes. The results demonstrate that our proposed algorithm has the potential to solve the feature selection problem in different applications with high-dimensional datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Gene selection and cancer classification using interaction-based feature clustering and improved-binary Bat algorithm

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine

Lead the way for us

Similar Papers

NNFSRR: Nearest Neighbor Feature Selection and Redundancy Removal Method for Nearest Neighbor Search in Microarray Gene Expression Data
Rupali Bhartiya ... Gend Lal Prajapati
EAI Endorsed Transactions on Pervasive Health and Technology | VOL. 9
Rupali Bhartiya, et. al.Rupali Bhartiya ... Gend Lal Prajapati
19 Sep 2023
EAI Endorsed Transactions on Pervasive Health and Technology | VOL. 9

Rough-FS
Rashmi Rekha Sahoo ... Smita Prava Mishra
-
Rashmi Rekha Sahoo, et. al.Rashmi Rekha Sahoo ... Smita Prava Mishra
03 Sep 2012
03 Sep 2012

Gene expression feature selection for prostate cancer diagnosis using a two-phase heuristic-deterministic search strategy.
Saleh Shahbeig ... Mohammad Sadegh Helfroush
IET systems biology | VOL. 12
Saleh Shahbeig, et. al.Saleh Shahbeig ... Mohammad Sadegh Helfroush
01 Aug 2018
IET systems biology | VOL. 12

A wrapper based binary bat algorithm with greedy crossover for attribute selection
S Akila ... S Allin Christe
Expert Systems with Applications | VOL. 187
S Akila, et. al.S Akila ... S Allin Christe
03 Sep 2021
Expert Systems with Applications | VOL. 187

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gene selection and cancer classification using interaction-based feature clustering and improved-binary Bat algorithm

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine