Abstract

In this work, we propose a scalable Bayesian procedure for learning the local dependence structure in a high-dimensional model where the variables possess a natural ordering. The ordering of variables can be indexed by time, the vicinities of spatial locations, and so on, with the natural assumption that variables far apart tend to have weak correlations. Applications of such models abound in a variety of fields such as finance, genome associations analysis and spatial modeling. We adopt a flexible framework under which each variable is dependent on its neighbors or predecessors, and the neighborhood size can vary for each variable. It is of great interest to reveal this local dependence structure by estimating the covariance or precision matrix while yielding a consistent estimate of the varying neighborhood size for each variable. The existing literature on banded covariance matrix estimation, which assumes a fixed bandwidth cannot be adapted for this general setup. We employ the modified Cholesky decomposition for the precision matrix and design a flexible prior for this model through appropriate priors on the neighborhood sizes and Cholesky factors. The posterior contraction rates of the Cholesky factor are derived which are nearly or exactly minimax optimal, and our procedure leads to consistent estimates of the neighborhood size for all the variables. Another appealing feature of our procedure is its scalability to models with large numbers of variables due to efficient posterior inference without resorting to MCMC algorithms. Numerical comparisons are carried out with competitive methods, and applications are considered for some real datasets.

Highlights

  • The problem of covariance matrix or precision matrix estimation has been extensively studied over the last few decades

  • We focus on investigating the local dependence structure in a highdimension model where the variables possess a natural ordering

  • We show that LANCE prior accurately unravels the local dependence structure

Read more

Summary

Introduction

The problem of covariance matrix or precision matrix estimation has been extensively studied over the last few decades. The dependence structure among the variables is encoded by the covariance matrix Σ or its inverse Ω = Σ−1. In high-dimensional settings where p can be much larger than the sample size, the traditional sample covariance matrix or inverse-Wishart prior leads to inconsistent estimates of Σ or Ω, see Johnstone and Lu. Scalable Bayesian Local Dependence Learning (2009) and Lee and Lee (2018). Restricted matrix classes with a banded or sparse structure are often imposed on the covariance or precision matrices as a common practice for consistent estimation (see, e.g., Bickel and Levina, 2008, Cai et al, 2016 and Lee et al, 2019). We focus on investigating the local dependence structure in a highdimension model where the variables possess a natural ordering. The existing literature on banded covariance matrix estimation which deals with a fixed bandwidth cannot be adapted for this set up

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call