Abstract
Although the operator (spectral) norm is one of the most widely used metrics for covariance estimation, comparatively little is known about the fluctuations of error in this norm. To be specific, let Σˆ denote the sample covariance matrix of n i.i.d. observations in Rp that arise from a population matrix Σ, and let Tn=n‖Σˆ−Σ‖ op. In the setting where the eigenvalues of Σ have a decay profile of the form λj(Σ)≍j−2β, we analyze how well the bootstrap can approximate the distribution of Tn. Our main result shows that up to factors of log(n), the bootstrap can approximate the distribution of Tn with respect to the Kolmogorov metric at the rate of n−β−1∕2 6β+4, which does not depend on the ambient dimension p. In addition, we offer a supporting result of independent interest that establishes a high-probability upper bound for Tn based on flexible moment assumptions. More generally, we discuss the consequences of our work beyond covariance matrices, and show how the bootstrap can be used to estimate the errors of sketching algorithms in randomized numerical linear algebra (RandNLA). An illustration of these ideas is also provided with a climate data example.
Submitted Version (Free)
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have