Multi-class cancer classification by total principal component regression (TPCR) using microarray gene expression data

Y Tan

doi:10.1093/nar/gki144

Abstract

DNA microarray technology provides a promising approach to the diagnosis and prognosis of tumors on a genome-wide scale by monitoring the expression levels of thousands of genes simultaneously. One problem arising from the use of microarray data is the difficulty to analyze the high-dimensional gene expression data, typically with thousands of variables (genes) and much fewer observations (samples), in which severe collinearity is often observed. This makes it difficult to apply directly the classical statistical methods to investigate microarray data. In this paper, total principal component regression (TPCR) was proposed to classify human tumors by extracting the latent variable structure underlying microarray data from the augmented subspace of both independent variables and dependent variables. One of the salient features of our method is that it takes into account not only the latent variable structure but also the errors in the microarray gene expression profiles (independent variables). The prediction performance of TPCR was evaluated by both leave-one-out and leave-half-out cross-validation using four well-known microarray datasets. The stabilities and reliabilities of the classification models were further assessed by re-randomization and permutation studies. A fast kernel algorithm was applied to decrease the computation time dramatically. (MATLAB source code is available upon request.)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nucleic Acids Research	Publication Date: Jan 7, 2005
Citations: 91	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

Multi-class cancer classification by total principal component regression (TPCR) using microarray gene expression data

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research

Lead the way for us

Similar Papers

Multi-class tumor classification by discriminant partial least squares using microarray gene expression data and assessment of classification models
Yongxi Tan ... Charles Wang
Computational Biology and Chemistry | VOL. 28
Yongxi Tan, et. al.Yongxi Tan ... Charles Wang
01 Jul 2004
Computational Biology and Chemistry | VOL. 28

Accurate detection of aneuploidies in array CGH and gene expression microarray data.
Chad L Myers ... Maitreya J Dunham
Bioinformatics | VOL. 20
Chad L Myers, et. al.Chad L Myers ... Maitreya J Dunham
29 Jul 2004
Bioinformatics | VOL. 20

Highly interconnected genes in disease-specific networks are enriched for disease-associated polymorphisms
Fredrik Barrenäs ... Hui Wang
Genome Biology | VOL. 13
Fredrik Barrenäs, et. al.Fredrik Barrenäs ... Hui Wang
01 Jan 2012
Genome Biology | VOL. 13

A Survey on Deep Learning Techniques for Prognosis and Diagnosis of Cancer from Microarray Gene Expression Data
Rahul Shahane ... C S R Prabhu
Journal of Computational and Theoretical Nanoscience | VOL. 16
Rahul Shahane, et. al.Rahul Shahane ... C S R Prabhu
01 Dec 2019
Journal of Computational and Theoretical Nanoscience | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-class cancer classification by total principal component regression (TPCR) using microarray gene expression data

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research