Abstract

In the field of biomedicine, applications of the identification of biomarkers require a robust gene selection mechanism. For identifying the characteristic marker of an observed event, the selection of attributes becomes important. The robustness of gene selection methods affects detecting biologically meaningful genes in tumor diagnosis. For mapping, sequential features Long-short-term memory (LSTM) was used with Artificial Immune Recognition Systems (AIRS) to remember gene sequences and effectively recall of learned sequential patterns. An attempt was made to improve AIRS with LSTM which is a type of RNNs to produce discriminative gene subsets for finding biologically meaningful genes in tumor diagnosis. The algorithms were evaluated using six common cancer microarray datasets. By converging to the intrinsic information of the microarray datasets, specific groups like functions of the co-regulated groups were observed. The results showed that the LSTM based AIRS model can successfully identify biologically significant genes from the microarray datasets. Also the predictive genes for biological sequences areimportant in gene expression microarrays.This study confirms that different genes can be found in the same pathways. It was also found that the gene subsets selected by the algorithms were involved in important biological pathways.In this work we try an LSTM on our learning problem. We suspected that recurrent neural networks would be a good architecture for making predictions.The results show that the optimal gene subsets were based on the suggested framework, so they should have rich biomedical interpretability.

Highlights

  • MICROARRAY TECHNOLOGY is a technology with a high process capacity that can identify thousands of genes at the same time.https://orcid.org/0000-0002-6652-4339Manuscript received August 10, 2019; accepted September 24, 2020

  • Feature selection models were implemented based on gated recurrent unit (GRU), long short-term memory (LSTM), recurrent neural network (RNN) and bidirectional LSTM for microarray datasets

  • We examined the possibility of utilizing Deep Neural Network (DNN) recurrent neural network models to learn disease-related genes, and we used them for the prediction of important biological pathways

Read more

Summary

Introduction

MICROARRAY TECHNOLOGY is a technology with a high process capacity that can identify thousands of genes at the same time.https://orcid.org/0000-0002-6652-4339Manuscript received August 10, 2019; accepted September 24, 2020. In the study conducted by [1], a new framework of feature selection based on recurrent neural network (RNN) was suggested to select a subset of features. The suggested model was applied to select features from microarray data for cell classification. Feature selection models were implemented based on gated recurrent unit (GRU), long short-term memory (LSTM), RNN and bidirectional LSTM for microarray datasets. In the study carried out by [2], a deep neural network model was improved by feature selection algorithms in predicting various biomedical phenotypes. Five binary classification methylome datasets were selected to compute the prediction performances of CNN/DBN/RNN models by utilizing the feature selected by the eleven feature selection algorithms. The results showed that the Deep Belief Network (DBN) model

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call