Investigating the Efficacy of Nonlinear Dimensionality Reduction Schemes in Classifying Gene and Protein Expression Studies

G Lee,C Rodriguez,A Madabhushi

doi:10.1109/tcbb.2008.36

Abstract

The recent explosion in procurement and availability of high-dimensional gene- and protein-expression profile datasets for cancer diagnostics has necessitated the development of sophisticated machine learning tools with which to analyze them. A major limitation in the ability to accurate classify these high-dimensional datasets stems from the 'curse of dimensionality', occurring in situations where the number of genes or peptides significantly exceeds the total number of patient samples. Previous attempts at dealing with this issue have mostly centered on the use of a dimensionality reduction (DR) scheme, Principal Component Analysis (PCA), to obtain a low-dimensional projection of the high-dimensional data. However, linear PCA and other linear DR methods, which rely on Euclidean distances to estimate object similarity, do not account for the inherent underlying nonlinear structure associated with most biomedical data. The motivation behind this work is to identify the appropriate DR methods for analysis of high-dimensional gene- and protein-expression studies. Towards this end, we empirically and rigorously compare three nonlinear (Isomap, Locally Linear Embedding, Laplacian Eigenmaps) and three linear DR schemes (PCA, Linear Discriminant Analysis, Multidimensional Scaling) with the intent of determining a reduced subspace representation in which the individual object classes are more easily discriminable.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Investigating the Efficacy of Nonlinear Dimensionality Reduction Schemes in Classifying Gene and Protein Expression Studies

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics

Lead the way for us

Journal: IEEE/ACM Transactions on Computational Biology and Bioinformatics	Publication Date: Jul 1, 2008
Citations: 159

Similar Papers

A critical study of different dimensionality reduction methods for gear crack degradation assessment under different operating conditions
Xiang Wan ... Qing Zhang
Measurement | VOL. 78
Xiang Wan, et. al.Xiang Wan ... Qing Zhang
22 Oct 2015
Measurement | VOL. 78

Comparative study of different dimensionality reduction methods in hyperspectral image classification
Lei Kang ... Xiaoqing Hu
Journal of Physics: Conference Series | VOL. 2024
Lei Kang, et. al.Lei Kang ... Xiaoqing Hu
01 Sep 2021
Journal of Physics: Conference Series | VOL. 2024

An Empirical Comparison of Dimensionality Reduction Methods for Classifying Gene and Protein Expression Datasets
George Lee ... Anant Madabhushi
-
George Lee, et. al.George Lee ... Anant Madabhushi
07 May 2007
07 May 2007

Unsupervised Dimensionality Reduction for High-Dimensional Data Classification
...
-
, et. al. ...
31 Aug 2017
31 Aug 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Investigating the Efficacy of Nonlinear Dimensionality Reduction Schemes in Classifying Gene and Protein Expression Studies

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics