Abstract

The Hotelling statistic is used to test the hypothesis about the location parameter of multivariate Gaussian distribution, and it is significantly sensitive to outliers. Also, we cannot calculate it when the sample size is less than the number of variables because this statistic needs the inverse of the covariance matrix, and the sample covariance matrix is singular in high dimensional data. Although a new approach, based on shrinkage estimation, was proposed to solve this singularity problem, this estimator is still sensitive to outliers. On the other hand, a robust one sample Hotelling statistic was proposed by using the minimum covariance determinant (MCD) estimates instead of classical ones. Since the MCD estimates cannot be calculated when this statistic cannot be used in high-dimensional data. This study proposes to use the minimum regularized covariance determinant (MRCD) estimator instead of classical or MCD. The MRCD estimator is a robust location and scatter estimator, which can be calculated in high-dimensional data. We obtain the asymptotic distribution of the proposed test statistic using Monte Carlo simulations and examine the power and robustness properties of the test statistic with simulated datasets. As a result, we show that the approximate distribution of the test statistic is proper, and the proposed robust test statistic can be used to test the hypothesis about the location parameter of contaminated high dimensional data. Finally, we construct an R function in the MVTests package to perform our proposed test statistic.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call