Hybrid Symmetrical Uncertainty and Reference Set Harmony Search Algorithm for Gene Selection Problem

Salam Salameh Shreem,Nor Samsiah Sani,Mohd Zakree Ahmad Nazri,Salwani Abdullah

doi:10.3390/math10030374

Salam Salameh Shreem, Nor Samsiah Sani + Show 2 more

Open Access

https://doi.org/10.3390/math10030374

Copy DOI

Journal: Mathematics	Publication Date: Jan 26, 2022
Citations: 8	License type: CC BY 4.0

Affiliation: National University of Malaysia

Abstract

Selecting the most miniature possible set of genes from microarray datasets for clinical diagnosis and prediction is one of the most challenging machine learning tasks. A robust gene selection technique is required to identify the most significant subset of genes by removing spurious or non-predictive genes from the original dataset without sacrificing or reducing classification accuracy. Numerous studies have attempted to address this issue by implementing either a filter or a wrapper. Although the filter approaches are computationally efficient, they are completely independent of the induction algorithm. On the other hand, wrapper approaches outperform filter approaches but are computationally more expensive. Therefore, this study proposes an enhanced gene selection method that uses a hybrid technique that combines the Symmetrical Uncertainty (SU) filter and Reference Set Harmony Search Algorithm (RSHSA) wrapper method, known as SU-RSHSA. The framework to develop the proposed SU-RSHSA includes numerous stages: (1) investigate a novel gene selection method based on the HSA and will then determine appropriate values for the HSA’s parameters, (2) enhance the construction process of the initial harmony memory while satisfying the diversity of the solution by embedding a reference set within the HSA (RSHSA), and (3) investigates the effect of integrating Symmetrical Uncertainty (SU) as a filter and RSHSA as a wrapper (SU-RSHSA) to maximize classification accuracy by leveraging their respective advantages. The results demonstrate that the SU-RSHSA outperforms the original HSA and SU-HSA in terms of classification accuracy, a small number of selected relevant genes, and reduced computational time. More importantly, the proposed SU-RSHSA gene selection method effectively generates a small subset of salient genes with high classification accuracy.

Highlights

DNA microarrays and RNA sequencing (RNA-seq) are the two significant technologies in carrying out high-throughput analysis of transcript abundance
This may be due to the lower number of solutions in the Reference Set Harmony Memory (RSHM) compared with the number of solutions in the original harmony memory (HM)
The Reference Set Harmony Search Algorithm (RSHSA) performed faster than the harmony search algorithm (HSA) in all datasets. This may be due to the lower number of solutions in the RSHM when compared to the number of solutions in the original HM

Summary

Introduction

DNA microarrays and RNA sequencing (RNA-seq) are the two significant technologies in carrying out high-throughput analysis of transcript abundance. The advancement of these technologies has enabled scientists to accumulate massive gene expression microarray data. Selecting a subset of genes that is optimal for the purpose classification is an arduous and crucial task because the number of genes that have a high correlation with a specific phenotype is very small compared to the thousands of genes in the sample To facilitate this task, a feature selection method was proposed in reducing the dimensionality of features by choosing the most salient genes and eliminating the redundant and irrelevant genes while retaining high classification accuracy

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hybrid Symmetrical Uncertainty and Reference Set Harmony Search Algorithm for Gene Selection Problem

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

Wrapper-based gene selection with Markov blanket
Aiguo Wang ... Gil Alterovitz
Computers in Biology and Medicine | VOL. 81
Aiguo Wang, et. al.Aiguo Wang ... Gil Alterovitz
05 Dec 2016
Computers in Biology and Medicine | VOL. 81

Gene selection for classification of microarray data based on the Bayes error.
Ji-Gang Zhang ... Hong-Wen Deng
BMC Bioinformatics | VOL. 8
Ji-Gang Zhang, et. al.Ji-Gang Zhang ... Hong-Wen Deng
03 Oct 2007
BMC Bioinformatics | VOL. 8

A TRIZ-inspired bat algorithm for gene selection in cancer classification
Mohammed Azmi Al-Betar ... Saeid M Abu-Romman
Genomics | VOL. 112
Mohammed Azmi Al-Betar, et. al.Mohammed Azmi Al-Betar ... Saeid M Abu-Romman
30 Oct 2019
Genomics | VOL. 112

Microarray Gene Expression Data for Detection Alzheimer’s Disease Using k-means and Deep Learning
Heba M Al-Bermany ... Sura Z Al-Rashid
-
Heba M Al-Bermany, et. al.Heba M Al-Bermany ... Sura Z Al-Rashid
24 Feb 2021
24 Feb 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hybrid Symmetrical Uncertainty and Reference Set Harmony Search Algorithm for Gene Selection Problem

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics