Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach

Dongmei Li,Timothy D Dye,Will X Chen,Nisha I Parikh,Marc A Le Pape

doi:10.1371/journal.pone.0080099

Dongmei Li, Timothy D Dye + Show 3 more

Open Access

https://doi.org/10.1371/journal.pone.0080099

Copy DOI

Abstract

Microarrays are widely used for examining differential gene expression, identifying single nucleotide polymorphisms, and detecting methylation loci. Multiple testing methods in microarray data analysis aim at controlling both Type I and Type II error rates; however, real microarray data do not always fit their distribution assumptions. Smyth's ubiquitous parametric method, for example, inadequately accommodates violations of normality assumptions, resulting in inflated Type I error rates. The Significance Analysis of Microarrays, another widely used microarray data analysis method, is based on a permutation test and is robust to non-normally distributed data; however, the Significance Analysis of Microarrays method fold change criteria are problematic, and can critically alter the conclusion of a study, as a result of compositional changes of the control data set in the analysis. We propose a novel approach, combining resampling with empirical Bayes methods: the Resampling-based empirical Bayes Methods. This approach not only reduces false discovery rates for non-normally distributed microarray data, but it is also impervious to fold change threshold since no control data set selection is needed. Through simulation studies, sensitivities, specificities, total rejections, and false discovery rates are compared across the Smyth's parametric method, the Significance Analysis of Microarrays, and the Resampling-based empirical Bayes Methods. Differences in false discovery rates controls between each approach are illustrated through a preterm delivery methylation study. The results show that the Resampling-based empirical Bayes Methods offer significantly higher specificity and lower false discovery rates compared to Smyth's parametric method when data are not normally distributed. The Resampling-based empirical Bayes Methods also offers higher statistical power than the Significance Analysis of Microarrays method when the proportion of significantly differentially expressed genes is large for both normally and non-normally distributed data. Finally, the Resampling-based empirical Bayes Methods are generalizable to next generation sequencing RNA-seq data analysis.

Highlights

Microarray technology is widely used to examine the activity level of thousands of genes simultaneously in human cells to better understand differential gene activation across diseases, such as heart diseases, infectious diseases, mental illness, and health disparities across ethnic groups
They further modified the penalized t-statistics through a parametric empirical Bayes approach using a simple mixture of normal models and a conjugate prior, and showed that their empirical Bayes method had both lower Type I error rates and Type II error rates compared to the penalized t-statistics
To reduce false discovery rates for nonnormally distributed microarray data, we propose a novel approach combining resampling with empirical Bayes methods: Resampling-based empirical Bayes Methods (RBMs)

Summary

Introduction

Microarray technology is widely used to examine the activity level of thousands of genes simultaneously in human cells to better understand differential gene activation across diseases, such as heart diseases, infectious diseases, mental illness, and health disparities across ethnic groups. Genes with small sample variances are more likely to have large t-statistics even when they are not differentially expressed Both Tusher et al [4] and Efron et al [5] modified the ordinary t-statistic to have penalized t-statistics by adding a penalty to the standard deviation. Lonnstedt and Speed [6] showed that both forms of penalized t-statistics were far superior to the ordinary tstatistic for selecting differentially expressed genes They further modified the penalized t-statistics through a parametric empirical Bayes approach using a simple mixture of normal models and a conjugate prior, and showed that their empirical Bayes method had both lower Type I error rates and Type II error rates compared to the penalized t-statistics

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Nov 27, 2013
Citations: 5	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Significance analysis of microarray for relative quantitation of LC/MS data in proteomics
Bryan Ap Roxas ... Qingbo Li
BMC Bioinformatics | VOL. 9
Bryan Ap Roxas, et. al.Bryan Ap Roxas ... Qingbo Li
10 Apr 2008
BMC Bioinformatics | VOL. 9

A comprehensive evaluation of SAM, the SAM R-package and a simple modification to improve its performance.
Shunpu Zhang
BMC Bioinformatics | VOL. 8
Shunpu ZhangShunpu Zhang
29 Jun 2007
BMC Bioinformatics | VOL. 8

Differential expression of microRNAs in mouse liver under aberrant energy metabolic status
Shengjie Li ... Chen-Yu Zhang
Journal of Lipid Research | VOL. 50
Shengjie Li, et. al.Shengjie Li ... Chen-Yu Zhang
01 Sep 2009
Journal of Lipid Research | VOL. 50

USING WEIGHTED PERMUTATION SCORES TO DETECT DIFFERENTIAL GENE EXPRESSION WITH MICROARRAY DATA
Xu Guo ... Wei Pan
Journal of Bioinformatics and Computational Biology | VOL. 03
Xu Guo, et. al.Xu Guo ... Wei Pan
01 Aug 2005
Journal of Bioinformatics and Computational Biology | VOL. 03

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessing Differential Expression in Two-Color Microarrays: A Resampling-Based Empirical Bayes Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE