Nonnegative matrix factorization: an analytical and interpretive tool in computational biology.

Karthik Devarajan

doi:10.1371/journal.pcbi.1000029

Karthik Devarajan

Open Access

https://doi.org/10.1371/journal.pcbi.1000029

Copy DOI

Journal: PLoS Computational Biology	Publication Date: Jul 25, 2008
Citations: 459	License type: CC BY 4.0

Affiliation: Fox Chase Cancer Center

Abstract

In the last decade, advances in high-throughput technologies such as DNA microarrays have made it possible to simultaneously measure the expression levels of tens of thousands of genes and proteins. This has resulted in large amounts of biological data requiring analysis and interpretation. Nonnegative matrix factorization (NMF) was introduced as an unsupervised, parts-based learning paradigm involving the decomposition of a nonnegative matrix V into two nonnegative matrices, W and H, via a multiplicative updates algorithm. In the context of a p×n gene expression matrix V consisting of observations on p genes from n samples, each column of W defines a metagene, and each column of H represents the metagene expression pattern of the corresponding sample. NMF has been primarily applied in an unsupervised setting in image and natural language processing. More recently, it has been successfully utilized in a variety of applications in computational biology. Examples include molecular pattern discovery, class comparison and prediction, cross-platform and cross-species analysis, functional characterization of genes and biomedical informatics. In this paper, we review this method as a data analytical and interpretive tool in computational biology with an emphasis on these applications.

Highlights

The rapid development in high-throughput technologies in the past decade has given rise to large-scale biological data in the form of expression profiles of tens of thousands of genes and proteins, often with only a handful of tissue samples
The objective is to identify differentially expressed genes between the different classes of interest; in class prediction, the emphasis is on building a predictive gene set based on the class labels and expression profiles of known samples, and to apply it to a new sample to predict its class
We review nonnegative matrix factorization (NMF) and its applications in computational biology, with an emphasis on the analysis and interpretation of high-throughput biological data such as those above

Summary

Introduction

The rapid development in high-throughput technologies in the past decade has given rise to large-scale biological data in the form of expression profiles of tens of thousands of genes and proteins, often with only a handful of tissue samples. Dimensionality reduction and visualization are key aspects in effectively analyzing and interpreting the high-dimensional data in this setting. Such unsupervised approaches are useful and relevant when there is no a priori knowledge of the expected gene expression patterns for a given set of genes or for any phenotype (such as experimental condition, tissue type, or patient). In studies where such prior knowledge is available, the focus is on class comparison or class prediction. We examine the usefulness of its stochastic nature in selecting an appropriate model for a given dataset and for faster implementation of the algorithm

Objectives

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Nonnegative matrix factorization: an analytical and interpretive tool in computational biology.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS Computational Biology

Lead the way for us

Similar Papers

Advances in Nonnegative Matrix and Tensor Factorization
A Cichocki ... P Smaragdis
Computational Intelligence and Neuroscience | VOL. 2008
A Cichocki, et. al.A Cichocki ... P Smaragdis
01 Jan 2008
Computational Intelligence and Neuroscience | VOL. 2008

On nonnegative matrix factorization algorithms for signal-dependent noise with application to electromyography data.
Karthik Devarajan ... Vincent C K Cheung
Neural Computation | VOL. 26
Karthik Devarajan, et. al.Karthik Devarajan ... Vincent C K Cheung
31 Mar 2014
Neural Computation | VOL. 26

Application of non-negative and local non negative matrix factorization to facial expression recognition
...
-
, et. al. ...
23 Aug 2004
23 Aug 2004

Matrix and Tensor Decompositions
Karthik Devarajan
-
Karthik DevarajanKarthik Devarajan
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Nonnegative matrix factorization: an analytical and interpretive tool in computational biology.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS Computational Biology