Measuring similarity between gene expression profiles: a Bayesian approach

Viet-Anh Nguyen,Pietro Lió

doi:10.1186/1471-2164-10-s3-s14

Abstract

BackgroundGrouping genes into clusters on the basis of similarity between their expression profiles has been the main approach to predict functional modules, from which important inference or further investigation decision could be made. While the univocal determination of similarity metric is important, current practices are normally involved with Euclidean distance and Pearson correlation, of which assumptions are not likely the case for high-throughput microarray data.ResultsWe advocate the use of a novel metric - BayesGen - to measure similarity between gene expression profiles, and demonstrate its performance on two important applications: constructing genome-wide co-expression network, and clustering cancer human tissues into subtypes. BayesGen is formulated as the evidence ratio between two alternative hypotheses about the generating mechanism of a given pair of genes, and incorporates as prior knowledge the global characteristics of the whole dataset. Through the joint modelling of expected intensity levels and noise variances, it addresses the inherent nonlinearity and the association of noise levels across different microarray value ranges. The full Bayesian formulation also facilitates the possibility of meta-analysis.ConclusionBayesGen allows more effective extraction of similarity information between genes from microarray expression data, which has significant effect on various inference tasks. It also provides a robust choice for other object-feature data, as illustrated through the results of the test on synthetic data.

Highlights

Grouping genes into clusters on the basis of similarity between their expression profiles has been the main approach to predict functional modules, from which important inference or further investigation decision could be made
We suggested the use of BayesGen - a new metric for measuring similarity/distance between gene expression profiles
Based on the observation that both data points' intensity levels and their relative variance jointly contribute to the identification of the underlying cellular processes, the metric was derived using a full Bayesian approach, which incorporates as prior knowledge the global characteristics of the whole dataset

Summary

Introduction

Grouping genes into clusters on the basis of similarity between their expression profiles has been the main approach to predict functional modules, from which important inference or further investigation decision could be made. While the univocal determination of similarity metric is important, current practices are normally involved with Euclidean distance and Pearson correlation, of which assumptions are not likely the case for high-throughput microarray data. With the development of high-throughput experimental techniques, biological research have been transformed into a data-rich discipline. The analysis of microarray data is normally based on the reasoning that variations in gene expression patterns under different experimental conditions are the (page number not for citation purposes). By grouping genes with similar expression profiles into clusters, researchers have been able to make meaningful inference of regulatory modules and functional pathways [2]

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genomics	Publication Date: Jan 1, 2009
Citations: 35	License type: cc-by

R Discovery Prime

R Discovery Prime

Measuring similarity between gene expression profiles: a Bayesian approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

Prognostic Gene Expression Signatures Can Be Measured in Tissues Collected in RNAlater Preservative
Dondapati Chowdary ... Abhijit Mazumder
The Journal of Molecular Diagnostics | VOL. 8
Dondapati Chowdary, et. al.Dondapati Chowdary ... Abhijit Mazumder
01 Feb 2006
The Journal of Molecular Diagnostics | VOL. 8

Successful Application of Microarray Technology to Microdissected Formalin-Fixed, Paraffin-Embedded Tissue
Renata A Coudry ... Margie L Clapper
The Journal of Molecular Diagnostics | VOL. 9
Renata A Coudry, et. al.Renata A Coudry ... Margie L Clapper
01 Feb 2007
The Journal of Molecular Diagnostics | VOL. 9

Global analysis of gene function in yeast by quantitative phenotypic profiling
James A Brown ... Nicola M Burrows
Molecular Systems Biology | VOL. 2
James A Brown, et. al.James A Brown ... Nicola M Burrows
01 Jan 2006
Molecular Systems Biology | VOL. 2

Gene Expression Analysis of Immune-Mediated Arrest of Tumorigenesis in a Transgenic Mouse Model of HER-2/neu-Positive Basal-Like Mammary Carcinoma
Annalisa Astolfi ... Pier-Luigi Lollini
The American Journal of Pathology | VOL. 166
Annalisa Astolfi, et. al.Annalisa Astolfi ... Pier-Luigi Lollini
01 Apr 2005
The American Journal of Pathology | VOL. 166

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Measuring similarity between gene expression profiles: a Bayesian approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics