Abstract

Cancer is the uncontrolled growth of abnormal cells in the body and is a major death cause now a days. Cancer may arise anywhere in the human body, and it names are remarked as body parts such as colon cancer, lung cancer, breast cancer. It is notable that cancer treatment is much easier in the initial stage rather than it outbreaks. DNA microarray based gene expression profiling has become efficient technique for cancer identification in early stage and a number of studies are available in this regard. Existing methods used different feature selection methods (e.g., wrapper and filter approaches) to select relevant genes and then employed distinct classifiers (e.g., artificial neural network, Naive Bayes, Decision Tree, Support Vector Machine) to identify cancer. This study considered information theoretic based minimum Redundancy Maximum Relevance (mRMR)method to select important genes and then employed artificial neural network (ANN) for cancer classification. Proposed mRMR-ANN method has been tested on a suite of benchmark data sets of various cancer. Experimental results revealed the proposed method as an effective method for cancer classification when performance compared with several related exiting methods.

Highlights

  • Cancer is the uncontrolled growth of abnormal cells in the body and is a major death cause nowadays

  • ALL/AML dataset contains 72 samples of 7,129 genes; in which 47 samples are for acute lymphoblastic leukemia (ALL) and 25 samples are for acute myeloid leukemia (AML)

  • Cancer treatment is much easier in the initial stage and DNA microarray based gene expression profiling has become an efficient technique for it

Read more

Summary

INTRODUCTION

Cancer is the uncontrolled growth of abnormal cells in the body and is a major death cause nowadays. A number of techniques have been investigated in past several years for cancer classification from DNA microarray gene expression data. Takahashi et al [4] investigated a hybrid method of projective Adaptive Resonance Theory based ANN and boosted fuzzy classifier with SWEEP operator for cancer classification They combined wrapper and filter approaches for gene selection. SVM was used to classify cancer from features of the selected genes They tested the algorithm on several gene expression microarray datasets including colon, leukemia, ALL/AML cancer. Www.ijacsa.thesai.org [8] investigated gene expressions based colon classification (GECC) using different feature selection methods including mRMR and ensemble of SVMs. A modified version of SVM, called Transductive SVM, is investigated for cancer classification by Maulik et al [9].

Preprocessing of Microarray Gene Expression Data
EXPERIMENTAL STUDIES
Benchmark Datasets
Experimental Setup
Experimental Results and Performance Comparison
Findings
CONCLUSIONS
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call