LS Bound based gene selection for DNA microarray data

X Zhou,K Z Mao

doi:10.1093/bioinformatics/bti216

Abstract

One problem with discriminant analysis of DNA microarray data is that each sample is represented by quite a large number of genes, and many of them are irrelevant, insignificant or redundant to the discriminant problem at hand. Methods for selecting important genes are, therefore, of much significance in microarray data analysis. In the present study, a new criterion, called LS Bound measure, is proposed to address the gene selection problem. The LS Bound measure is derived from leave-one-out procedure of LS-SVMs (least squares support vector machines), and as the upper bound for leave-one-out classification results it reflects to some extent the generalization performance of gene subsets. We applied this LS Bound measure for gene selection on two benchmark microarray datasets: colon cancer and leukemia. We also compared the LS Bound measure with other evaluation criteria, including the well-known Fisher's ratio and Mahalanobis class separability measure, and other published gene selection algorithms, including Weighting factor and SVM Recursive Feature Elimination. The strength of the LS Bound measure is that it provides gene subsets leading to more accurate classification results than the filter method while its computational complexity is at the level of the filter method. A companion website can be accessed at http://www.ntu.edu.sg/home5/pg02776030/lsbound/. The website contains: (1) the source code of the gene selection algorithm; (2) the complete set of tables and figures regarding the experimental study; (3) proof of the inequality (9). ekzmao@ntu.edu.sg.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LS Bound based gene selection for DNA microarray data

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Dec 14, 2004
Citations: 106

Similar Papers

Gene selection algorithms for microarray data based on least squares support vector machine.
E Ke Tang ... Pn Suganthan
BMC Bioinformatics | VOL. 7
E Ke Tang, et. al.E Ke Tang ... Pn Suganthan
27 Feb 2006
BMC Bioinformatics | VOL. 7

Gene Selection of DNA Microarray Data Based on Regularization Networks
Xin Zhou ... Kezhi Mao
-
Xin Zhou, et. al.Xin Zhou ... Kezhi Mao
01 Jan 2004
01 Jan 2004

Selecting marker genes for cancer classification using supervised weighted kernel clustering and the support vector machine
Jooyong Shim ... Changha Hwang
Computational Statistics and Data Analysis | VOL. 53
Jooyong Shim, et. al.Jooyong Shim ... Changha Hwang
01 May 2008
Computational Statistics and Data Analysis | VOL. 53

MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data
Xin Zhou ... David P Tuck
Bioinformatics | VOL. 23
Xin Zhou, et. al.Xin Zhou ... David P Tuck
01 May 2007
Bioinformatics | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LS Bound based gene selection for DNA microarray data

Abstract

Talk to us

Similar Papers

More From: Bioinformatics