Contaminated Chi-Square Modeling and Large-Scale ANOVA Testing

Richard Charnigo

doi:10.4172/2155-6180.1000157

Abstract

We propose a convenient moment-based procedure for testing the omnibus null hypothesis of no contamination of a central chi-square distribution by a non-central chi-square distribution. In sharp contrast with likelihood ratio tests for mixture models, there is no need for re-sampling or random field theory to obtain critical values. Rather, critical values are available from an asymptotic normal distribution, and there is excellent agreement between nominal and actual significance levels. This procedure may be used to model numerous chi-square statistics, obtained via monotonic transformations of F statistics, from large-scale ANOVA testing, such as that encountered in microarray data analysis. In that context, modeling chi-square statistics instead of p-values may improve detection of differential gene expression, as we demonstrate through simulation studies, while also reducing false declarations of the same, as we illustrate in a case study on aging and cognition. Our procedure may also be incorporated into a gene filtration process, which may reduce type II errors on genewise null hypotheses by justifying lighter controls for Type I errors.

Highlights

Consider the mixture model [1,2,3], with probability density function(1-λ)χ2ν(0)+λ χ2ν(μ) (1)where 0 ≤ λ ≤ 1, χ2ν(0) denotes the central chi-square pdf on ν>0 degrees of freedom, and χ2ν(μ) denotes the chi-square pdf on ν df, with non-centrality parameter μ ≥ 0
Employing the Contaminated Chi-square (CCS) model to analyze chi-square statistics, instead of the Contaminated Beta (CB) model to assess p-values resolves the aforementioned concern, because the omnibus null hypothesis from (2) is not rejected for the genes eliminated in step 3
We have developed a convenient procedure for testing the omnibus null hypothesis of no contamination of a central chi-square distribution by a non-central chi-square distribution

Summary

Introduction

Where 0 ≤ λ ≤ 1, χ2ν(0) denotes the central chi-square pdf on ν>0 degrees of freedom (df), and χ2ν(μ) denotes the chi-square pdf on ν df, with non-centrality parameter μ ≥ 0. To understand how the CCS model and omnibus null hypothesis relate to large-scale ANOVA testing, suppose that a microarray experiment [4,5] is performed to measure expression levels on each of n genes for subjects in independent samples of sizes g1, g2, ..., gK from K populations. Letting λ denote the proportion of genes for which mean expression levels are not equal across the K populations, we may regard the collection of rescaled test statistics X1, X2, ..., Xn as a sample from the CCS model with ν=(K-1). If mean expression levels are equal across the K populations for all genes, the CCS model reduces to χ2K-1(0) This is why λμ=0 is referred to as the omnibus null hypothesis. An appendix explains the rescaling of F statistics into approximate chi-square statistics

Background on Mixture Modeling

Findings

Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Contaminated Chi-Square Modeling and Large-Scale ANOVA Testing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biometrics & Biostatistics

Lead the way for us

Journal: Journal of Biometrics & Biostatistics	Publication Date: Jan 1, 2013
License type: cc-by

Similar Papers

Assessment of actual significance levels for covariate effects in NONMEM.
Ulrika Wählby ... E Niclas Jonsson
Journal of Pharmacokinetics and Pharmacodynamics | VOL. 28
Ulrika Wählby, et. al.Ulrika Wählby ... E Niclas Jonsson
01 Jan 2001
Journal of Pharmacokinetics and Pharmacodynamics | VOL. 28

LSOSS: Detection of Cancer Outlier Differential Gene Expression
Yupeng Wang ... Romdhane Rekaya
Biomarker Insights | VOL. 5
Yupeng Wang, et. al.Yupeng Wang ... Romdhane Rekaya
01 Jan 2009
Biomarker Insights | VOL. 5

Hypothesis Testing
Shane Allua ... Cheryl Bagley Thompson
Air Medical Journal | VOL. 28
Shane Allua, et. al.Shane Allua ... Cheryl Bagley Thompson
01 May 2009
Air Medical Journal | VOL. 28

Statistical inference for linear regression models with additive distortion measurement errors
Zhenghui Feng ... Qian Chen
Statistical Papers | VOL. 61
Zhenghui Feng, et. al.Zhenghui Feng ... Qian Chen
12 Nov 2018
Statistical Papers | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Contaminated Chi-Square Modeling and Large-Scale ANOVA Testing

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biometrics &amp; Biostatistics

More From: Journal of Biometrics & Biostatistics