Abstract

We propose a Bayesian methodology for estimating spiked covariance matrices with a jointly sparse structure in high dimensions. The spiked covariance matrix is reparameterized in terms of the latent factor model, where the loading matrix is equipped with a novel matrix spike-and-slab LASSO prior, which is a continuous shrinkage prior for modeling jointly sparse matrices. We establish the rate-optimal posterior contraction for the covariance matrix with respect to the spectral norm as well as that for the principal subspace with respect to the projection spectral norm loss. We also study the posterior contraction rate of the principal subspace with respect to the two-to-infinity norm loss, a novel loss function measuring the distance between subspaces that is able to capture entrywise eigenvector perturbations. We show that the posterior contraction rate with respect to the two-to-infinity norm loss is tighter than that with respect to the routinely used projection spectral norm loss under certain low-rank and bounded coherence conditions. In addition, a point estimator for the principal subspace is proposed with the rate-optimal risk bound with respect to the projection spectral norm loss. The numerical performance of the proposed methodology is assessed through synthetic examples and the analysis of a real-world face data example.

Highlights

  • In contemporary statistics, datasets are typically collected with high-dimensionality, where the dimension p can be significantly larger than the sample size n

  • The major contribution of this work is two-fold: Firstly, we establish the rate-optimal posterior contraction for the entire covariance matrix Σ with respect to the spectral norm loss as well as that for the principal subspace with respect to the projection spectral norm loss; Secondly, we focus on the two-to-infinity norm loss, a novel loss function measuring the entrywise behavior between linear subspaces

  • We present a collection of theoretical properties of the matrix spike-and-slab LASSO prior that are useful for deriving posterior contraction under the spiked covariance matrix models and may be of independent interest for other statistical tasks, e.g., sparse Bayesian linear regression with multivariate response (Ning and Ghosal, 2018)

Read more

Summary

Introduction

1996; Bernardo et al, 2003), Gaussian graphical models (Wainwright and Jordan, 2008; Liu et al, 2012), etc. Pati et al (2014), Gao and Zhou (2015), and Ning (2021) explore the posterior contraction rates for Bayesian estimation of sparse spiked covariance models. The major contribution of this work is two-fold: Firstly, we establish the rate-optimal posterior contraction for the entire covariance matrix Σ with respect to the spectral norm loss as well as that for the principal subspace with respect to the projection spectral norm loss; Secondly, we focus on the two-to-infinity norm loss, a novel loss function measuring the entrywise behavior between linear subspaces. The density of Π with respect to the underlying sigma-finite measure is denoted by π

Background and loss functions
A continuous matrix shrinkage prior for joint sparsity
Properties of the matrix spike-and-slab LASSO prior
Posterior contraction for the sparse Bayesian spiked covariance model
Synthetic examples
A face data example
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call