PAC-Bayesian bounds for sparse regression estimation with exponential weights

Pierre Alquier,Karim Lounici

doi:10.1214/11-ejs601

Abstract

We consider the sparse regression model where the number of parameters p is larger than the sample size n. The difficulty when considering high-dimensional problems is to propose estimators achieving a good compromise between statistical and computational performances. The Lasso is solution of a convex minimization problem, hence computable for large value of p. However stringent conditions on the design are required to establish fast rates of convergence for this estimator. Dalalyan and Tsybakov [17–19] proposed an exponential weights procedure achieving a good compromise between the statistical and computational aspects. This estimator can be computed for reasonably large p and satisfies a sparsity oracle inequality in expectation for the empirical excess risk only under mild assumptions on the design. In this paper, we propose an exponential weights estimator similar to that of [17] but with improved statistical performances. Our main result is a sparsity oracle inequality in probability for the true excess risk.

Highlights

We observe n independent pairs (X1, Y1), ..., (Xn, Yn) ∈ X × R such that (1.1)Yi = f (Xi) + Wi, 1 i n, where f : X → R is the unknown regression function and the noise variablesW1, . . . , Wn are independent of the design (X1, . . . , Xn) and such that EWi = 0 and EWi2 σ2 for some known σ2 > 0 and any 1 i n
Dalalyan and Tsybakov [19, 20, 21, 22] propose an exponential weights procedure related to the PAC-Bayesian approach with good statistical and computational performances
We propose to study two exponential weights estimation procedures

Summary

Introduction

We observe n independent pairs (X1, Y1), ..., (Xn, Yn) ∈ X × R (where X is any measurable set) such that (1.1). Dalalyan and Tsybakov [19, 20, 21, 22] propose an exponential weights procedure related to the PAC-Bayesian approach with good statistical and computational performances. They consider deterministic design, establishing their statistical result only for the empirical excess risk instead of the true excess risk R(·) − R(θ). Note that in a work parallel to ours, Rigollet and Tsybakov [46] consider exponentially weighted aggregates with discrete priors and suggest another version of the Metropolis-Hastings algorithm to compute their estimator.

Sparsity Oracle Inequality in Expectation

Sparsity Oracle Inequality in Probability

Practical computation of the estimator

Simulations

Proofs

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Statistics	Publication Date: Sep 14, 2010
Citations: 54	License type: cc-by

R Discovery Prime

R Discovery Prime

PAC-Bayesian bounds for sparse regression estimation with exponential weights

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

Sparse latent factor regression models for genome-wide and epigenome-wide association studies.
Basile Jumentier ... Barbara Heude
Statistical applications in genetics and molecular biology | VOL. 21
Basile Jumentier, et. al.Basile Jumentier ... Barbara Heude
27 Jan 2022
Statistical applications in genetics and molecular biology | VOL. 21

Are Latent Factor Regression and Sparse Regression Adequate?
Jianqing Fan ... Mengxin Yu
Journal of the American Statistical Association | VOL. 119
Jianqing Fan, et. al.Jianqing Fan ... Mengxin Yu
17 Jan 2023
Journal of the American Statistical Association | VOL. 119

Detection of genetic factors associated with multiple correlated imaging phenotypes by a sparse regression model
Dongdong Lin ... Jingyao Li
-
Dongdong Lin, et. al.Dongdong Lin ... Jingyao Li
01 Apr 2015
01 Apr 2015

LARGE-SCALE MULTIVARIATE SPARSE REGRESSION WITH APPLICATIONS TO UK BIOBANK.
Junyang Qian ... Manuel A Rivas
The Annals of Applied Statistics | VOL. 16
Junyang Qian, et. al.Junyang Qian ... Manuel A Rivas
01 Sep 2022
The Annals of Applied Statistics | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PAC-Bayesian bounds for sparse regression estimation with exponential weights

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics