A centroid-based gene selection method for microarray data classification

Shun Guo,Donghui Guo,Lifei Chen,Qingshan Jiang

doi:10.1016/j.jtbi.2016.03.034

Abstract

For classification problems based on microarray data, the data typically contains a large number of irrelevant and redundant features. In this paper, a new gene selection method is proposed to choose the best subset of features for microarray data with the irrelevant and redundant features removed. We formulate the selection problem as a L1-regularized optimization problem, based on a newly defined linear discriminant analysis criterion. Instead of calculating the mean of the samples, a kernel-based approach is used to estimate the class centroid to define both the between-class separability and the within-class compactness for the criterion. Theoretical analysis indicates that the global optimal solution of the L1-regularized criterion can be reached with a general condition, on which an efficient algorithm is derived to the feature selection problem in a linear time complexity with respect to the number of features and the number of samples. The experimental results on ten publicly available microarray datasets demonstrate that the proposed method performs effectively and competitively compared with state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A centroid-based gene selection method for microarray data classification

Abstract

Talk to us

Similar Papers

More From: Journal of Theoretical Biology

Lead the way for us

Journal: Journal of Theoretical Biology	Publication Date: Apr 4, 2016
Citations: 28

Similar Papers

A binary Krill Herd approach based feature selection for high dimensional data
V Preeja ... A H Shahana
-
V Preeja, et. al.V Preeja ... A H Shahana
01 Aug 2016
01 Aug 2016

SVM for network anomaly detection using ACO feature subset
Tahir Mehmood ... Helmi B Md Rais
-
Tahir Mehmood, et. al.Tahir Mehmood ... Helmi B Md Rais
01 May 2015
01 May 2015

Determination of biomarkers from microarray data using graph neural network and spectral clustering
Kun Yu ... Linjie Wang
Scientific Reports | VOL. 11
Kun Yu, et. al.Kun Yu ... Linjie Wang
01 Dec 2021
Scientific Reports | VOL. 11

Heart Disease Prediction Model Using Naïve Bayes Algorithm and Machine Learning Techniques
Maria Yousef ... Prof Khaled Batiha
International Journal of Engineering & Technology | VOL. 10
Maria Yousef, et. al.Maria Yousef ... Prof Khaled Batiha
02 Feb 2021
International Journal of Engineering & Technology | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A centroid-based gene selection method for microarray data classification

Abstract

Talk to us

Similar Papers

More From: Journal of Theoretical Biology