Enhancing Characteristic Gene Selection and Tumor Classification by the Robust Laplacian Supervised Discriminative Sparse PCA.

Lu-Xing Zhang,Jian Xu,He Yan,Dong-Jun Yu,Jiangning Song,Yan Liu

doi:10.1021/acs.jcim.1c01403

Abstract

Characteristic gene selection and tumor classification of gene expression data play major roles in genomic research. Due to the characteristics of a small sample size and high dimensionality of gene expression data, it is a common practice to perform dimensionality reduction prior to the use of machine learning-based methods to analyze the expression data. In this context, classical principal component analysis (PCA) and its improved versions have been widely used. Recently, methods based on supervised discriminative sparse PCA have been developed to improve the performance of data dimensionality reduction. However, such methods still have limitations: most of them have not taken into consideration the improvement of robustness to outliers and noise, label information, sparsity, as well as capturing intrinsic geometrical structures in one objective function. To address this drawback, in this study, we propose a novel PCA-based method, known as the robust Laplacian supervised discriminative sparse PCA, termed RLSDSPCA, which enforces the L2,1 norm on the error function and incorporates the graph Laplacian into supervised discriminative sparse PCA. To evaluate the efficacy of the proposed RLSDSPCA, we applied it to the problems of characteristic gene selection and tumor classification problems using gene expression data. The results demonstrate that the proposed RLSDSPCA method, when used in combination with other related methods, can effectively identify new pathogenic genes associated with diseases. In addition, RLSDSPCA has also achieved the best performance compared with the state-of-the-art methods on tumor classification in terms of major performance metrics. The codes and data sets used in the study are freely available at http://csbio.njust.edu.cn/bioinf/rlsdspca/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing Characteristic Gene Selection and Tumor Classification by the Robust Laplacian Supervised Discriminative Sparse PCA.

Abstract

Talk to us

Similar Papers

More From: Journal of chemical information and modeling

Lead the way for us

Journal: Journal of chemical information and modeling	Publication Date: Mar 30, 2022
Citations: 3

Similar Papers

Novel gene sets improve set-level classification of prokaryotic gene expression data
Matěj Holec ... Ondřej Kuželka
BMC Bioinformatics | VOL. 16
Matěj Holec, et. al.Matěj Holec ... Ondřej Kuželka
28 Oct 2015
BMC Bioinformatics | VOL. 16

Optimized LSTM with Dimensionality Reduction Based Gene Expression Data Classification
S Jacophine Susmi
Intelligent Automation & Soft Computing | VOL. 33
S Jacophine SusmiS Jacophine Susmi
01 Jan 2021
Intelligent Automation & Soft Computing | VOL. 33

A Review on Feature Selection Techniques for Gene Expression Data
S Vanjimalar ... D Ramyachitra
-
S Vanjimalar, et. al.S Vanjimalar ... D Ramyachitra
01 Dec 2018
01 Dec 2018

A graph-Laplacian PCA based on L<inf>1/2</inf>-norm constraint for characteristic gene selection
Chun-Mei Feng ... Dong-Qin Wang
-
Chun-Mei Feng, et. al.Chun-Mei Feng ... Dong-Qin Wang
01 Dec 2016
01 Dec 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Characteristic Gene Selection and Tumor Classification by the Robust Laplacian Supervised Discriminative Sparse PCA.

Abstract

Talk to us

Similar Papers

More From: Journal of chemical information and modeling