Minimax Rate Research Articles

We introduce a new framework for estimation of sparse normal means, bridging the gap between popular frequentist strategies (LASSO) and popular Bayesian strategies (spike-and-slab). The main thrust of this paper is to introduce the family of Spike-and-Slab LASSO (SS-LASSO) priors, which form a continuum between the Laplace prior and the point-mass spike-and-slab prior. We establish several appealing frequentist properties of SS-LASSO priors, contrasting them with these two limiting cases. First, we adopt the penalized likelihood perspective on Bayesian modal estimation and introduce the framework of Bayesian penalty mixing with spike-and-slab priors. We show that the SS-LASSO global posterior mode is (near) minimax rate-optimal under squared error loss, similarly as the LASSO. Going further, we introduce an adaptive two-step estimator which can achieve provably sharper performance than the LASSO. Second, we show that the whole posterior keeps pace with the global mode and concentrates at the (near) minimax rate, a property that is known \textsl{not to hold} for the single Laplace prior. The minimax-rate optimality is obtained with a suitable class of independent product priors (for known levels of sparsity) as well as with dependent mixing priors (adapting to the unknown levels of sparsity). Up to now, the rate-optimal posterior concentration has been established only for spike-and-slab priors with a point mass at zero. Thus, the SS-LASSO priors, despite being continuous, possess similar optimality properties as the “theoretically ideal” point-mass mixtures. These results provide valuable theoretical justification for our proposed class of priors, underpinning their intuitive appeal and practical potential.

Read full abstract

We study asymptotic optimality of inference in a high-dimensional sparse normal means model using a broad class of one-group shrinkage priors. Assuming that the proportion of non-zero means is known, we show that the corresponding Bayes estimates asymptotically attain the minimax risk (up to a multiplicative constant) for estimation with squared error loss. The constant is shown to be 1 for the important sub-class of “horseshoe-type” priors proving exact asymptotic minimaxity property for these priors, a result hitherto unknown in the literature. An empirical Bayes version of the estimator is shown to achieve the minimax rate in case the level of sparsity is unknown. We prove that the resulting posterior distributions contract around the true mean vector at the minimax optimal rate and provide important insight about the possible rate of posterior contraction around the corresponding Bayes estimator. Our work shows that for rate optimality, a heavy tailed prior with sufficient mass around zero is enough, a pole at zero like the horseshoe prior is not necessary. This part of the work is inspired by Pas et al. (2014). We come up with novel unifying arguments to extend their results over the general class of priors. Next we focus on simultaneous hypothesis testing for the means under the additive 0−1 loss where the means are modeled through a two-groups mixture distribution. We study asymptotic risk properties of certain multiple testing procedures induced by the class of one-group priors under study, when applied in this set-up. Our key results show that the tests based on the “horseshoe-type” priors asymptotically achieve the risk of the optimal solution in this two-groups framework up to the correct constant and are thus asymptotically Bayes optimal under sparsity (ABOS). This is the first result showing that in a sparse problem a class of one-group priors can exactly mimic the performance of an optimal two-groups solution asymptotically. Our work shows an intrinsic technical connection between the theories of minimax estimation and simultaneous hypothesis testing for such one-group priors.

Read full abstract

Minimax Rate Research Articles

Related Topics

Articles published on Minimax Rate

Bayesian Regression Tree Ensembles that Adapt to Smoothness and Sparsity

Minimax Optimal Convex Methods for Poisson Inverse Problems Under <inline-formula> <tex-math notation="LaTeX">$\ell_{q}$ </tex-math> </inline-formula>-Ball Sparsity

Fast Gaussian Process Regression for Big Data

Variational multiscale nonparametric regression: Smooth functions

On matrix estimation under monotonicity constraints

Robust functional estimation in the multivariate partial linear model

A frequency domain analysis of the error distribution from noisy high-frequency data

Denoising Flows on Trees

Optimal sup-norm rates and uniform inference on nonlinear functionals of nonparametric IV regression

Remember the curse of dimensionality: the case of goodness-of-fit testing in arbitrary dimension

Optimal bounds for aggregation of affine estimators

Bayesian estimation of sparse signals with a continuous spike-and-slab prior

Minimax lower bounds for function estimation on graphs

Non-parametric estimation of time varying AR(1)–processes with local stationarity and periodicity

Improved bounds for Square-Root Lasso and Square-Root Slope

Empirical Bayes analysis of spike and slab posterior distributions

A change-point problem and inference for segment signals

Bounds on the minimax rate for estimating a prior over a VC class from independent learning tasks

Asymptotic Optimality of One-Group Shrinkage Priors in Sparse High-dimensional Problems

Minimax wavelet estimation for multisample heteroscedastic nonparametric regression

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Minimax Rate Research Articles

Related Topics

Articles published on Minimax Rate

Bayesian Regression Tree Ensembles that Adapt to Smoothness and Sparsity

Minimax Optimal Convex Methods for Poisson Inverse Problems Under &lt;inline-formula&gt; &lt;tex-math notation="LaTeX"&gt;$\ell_{q}$ &lt;/tex-math&gt; &lt;/inline-formula&gt;-Ball Sparsity

Fast Gaussian Process Regression for Big Data

Variational multiscale nonparametric regression: Smooth functions

On matrix estimation under monotonicity constraints

Robust functional estimation in the multivariate partial linear model

A frequency domain analysis of the error distribution from noisy high-frequency data

Denoising Flows on Trees

Optimal sup-norm rates and uniform inference on nonlinear functionals of nonparametric IV regression

Remember the curse of dimensionality: the case of goodness-of-fit testing in arbitrary dimension

Optimal bounds for aggregation of affine estimators

Bayesian estimation of sparse signals with a continuous spike-and-slab prior

Minimax lower bounds for function estimation on graphs

Non-parametric estimation of time varying AR(1)–processes with local stationarity and periodicity

Improved bounds for Square-Root Lasso and Square-Root Slope

Empirical Bayes analysis of spike and slab posterior distributions

A change-point problem and inference for segment signals

Bounds on the minimax rate for estimating a prior over a VC class from independent learning tasks

Asymptotic Optimality of One-Group Shrinkage Priors in Sparse High-dimensional Problems

Minimax wavelet estimation for multisample heteroscedastic nonparametric regression

Minimax Optimal Convex Methods for Poisson Inverse Problems Under <inline-formula> <tex-math notation="LaTeX">$\ell_{q}$ </tex-math> </inline-formula>-Ball Sparsity