Cancer Classification from Microarray Data using Gene Feature Ranking

Abid Hasan,Golam Morshed Maruf,Shareef Shareef,Hawlader Abdullah Al Mamun,Paul Kawn

doi:10.5958/j.2249-3212.1.2.2

Abstract

A significant challenge in DNA (Deoxyribo Nucleic Acid) microarray analysis can be attributed to the problem of having a large number of features (genes) but with a small number of samples in the dataset. When applying statistical methods to analyse the microarray data, particular care is required to deal with problem such as the low classification accuracy of models brought about by the small number of features that have predictive capability. To overcome these problems, proper approaches for data normalisation, feature reduction, and identifying the optimal set of genes are critical. In this paper, we apply the Gene Feature Ranking [5] method to select genes with high trust values from high dimensional cancer microarray datasets. Our contribution lies in the use of a different metric for calculating the trust values that are more domain specific for cancer datasets. By choosing a pre-defined threshold based on user's knowledge, only genes that show sufficient trustworthiness to be considered for constructing the classification model are retained. Through experimentation on three microarray datasets, namely Acute Lymphoblastic Leukemia (ALL), lymph node negative primary breast cancer, and High Grade Glioma, we are able to confirm that the classification accuracy obtained by the genes selected by the modified GFR method is consistently higher than when the method was not used.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cancer Classification from Microarray Data using Gene Feature Ranking

Abstract

Talk to us

Similar Papers

More From: Pearl : A Journal of Library and Information Science

Lead the way for us

Similar Papers

Bisphosphonate treatment in primary breast cancer: Results from a randomised comparison of oral pamidronate versus no pamidronate in patients with primary breast cancer
Bent Kristensen ... Jonas Bergh
Acta Oncologica | VOL. 47
Bent Kristensen, et. al.Bent Kristensen ... Jonas Bergh
01 Jan 2008
Acta Oncologica | VOL. 47

Significance of 17bHSD Type 14 as a Predictive Factor for Adjuvant Tamoxifen Treatment Response in Breast Cancer.
T Sivik ... L Skoog
Cancer Research | VOL. 69
T Sivik, et. al.T Sivik ... L Skoog
15 Dec 2009
Significance of 17bHSD Type 14 as a Predictive Factor for Adjuvant Tamoxifen Treatment Response in Breast Cancer.
T Sivik ... L Skoog

Abstract 3142: Expression of C-X-C motif chemokine 10 (CXCL10) in breast cancer
Tove Sivik ... Lambert Skoog
Cancer Research | VOL. 71
Tove Sivik, et. al.Tove Sivik ... Lambert Skoog
15 Apr 2011
Abstract 3142: Expression of C-X-C motif chemokine 10 (CXCL10) in breast cancer
Tove Sivik ... Lambert Skoog

P1-06-06: The Importance of CXCL10 and CXCR3-A in Breast Cancer.
E Hilborn ... A Kot
Cancer Research | VOL. 71
E Hilborn, et. al.E Hilborn ... A Kot
15 Dec 2011
P1-06-06: The Importance of CXCL10 and CXCR3-A in Breast Cancer.
E Hilborn ... A Kot

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cancer Classification from Microarray Data using Gene Feature Ranking

Abstract

Talk to us

Similar Papers

More From: Pearl : A Journal of Library and Information Science