Abstract

This work is concerned with testing the population mean vector of nonnormal high-dimensional multivariate data. Several tests for high-dimensional mean vector, based on modifying the classical Hotelling T2 test, have been proposed in the literature. Despite their usefulness, they tend to have unsatisfactory power performance for heavy-tailed multivariate data, which frequently arise in genomics and quantitative finance. This article proposes a novel high-dimensional nonparametric test for the population mean vector for a general class of multivariate distributions. With the aid of new tools in modern probability theory, we proved that the limiting null distribution of the proposed test is normal under mild conditions when p is substantially larger than n. We further study the local power of the proposed test and compare its relative efficiency with a modified Hotelling T2 test for high-dimensional data. An interesting finding is that the newly proposed test can have even more substantial power gain with large p than the traditional nonparametric multivariate test does with finite fixed p. We study the finite sample performance of the proposed test via Monte Carlo simulations. We further illustrate its application by an empirical analysis of a genomics dataset. Supplementary materials for this article are available online.

Highlights

  • Let X1, ..., Xn be independent and identically distributed p-dimensional random vectors from the model Xi = μ + εi where εi is the random error to be specified later

  • The theory established in this paper suggests that when p > n, the asymptotic relative efficiency of the proposed new nonparametric test versus Chen and Qin’s extension of Hotelling’s T2 test is about 2.54

  • The gene set SEMENZA_HIF1_TARGETS is only on the top ten list of the new test and was found to be biologically related to insulin effect on human cells. Most of those significant gene sets are induced by Epidermal growth factor (EGF) or insulin-like growth factor (IGF)

Read more

Summary

A High-Dimensional Nonparametric Multivariate Test for Mean Vector

Bo Peng, and Graduate student, School of Statistics, University of Minnesota, Minneapolis, MN 55455. Runze Li Distinguished Professor, Department of Statistics and the Methodology Center, the Pennsylvania State University, University Park, PA 16802-2111

Introduction
A high-dimensional nonparametric test
The limiting null distribution
Local power analysis
Monte Carlo simulations
An application
Findings
Conclusion and discussions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call