Asymptotic properties of Lasso+mLS and Lasso+Ridge in sparse high-dimensional linear regression

Hanzhong Liu,Bin Yu

doi:10.1214/14-ejs875

Abstract

We study the asymptotic properties of Lasso+mLS and Lasso+ Ridge under the sparse high-dimensional linear regression model: Lasso selecting predictors and then modified Least Squares (mLS) or Ridge estimating their coefficients. First, we propose a valid inference procedure for parameter estimation based on parametric residual bootstrap after Lasso+ mLS and Lasso+Ridge. Second, we derive the asymptotic unbiasedness of Lasso+mLS and Lasso+Ridge. More specifically, we show that their biases decay at an exponential rate and they can achieve the oracle convergence rate of $s/n$ (where $s$ is the number of nonzero regression coefficients and $n$ is the sample size) for mean squared error (MSE). Third, we show that Lasso+mLS and Lasso+Ridge are asymptotically normal. They have an oracle property in the sense that they can select the true predictors with probability converging to $1$ and the estimates of nonzero parameters have the same asymptotic normal distribution that they would have if the zero parameters were known in advance. In fact, our analysis is not limited to adopting Lasso in the selection stage, but is applicable to any other model selection criteria with exponentially decay rates of the probability of selecting wrong models.

Highlights

Consider the sparse linear regression model Y = Xβ∗ + ǫ, (1)where ǫ = (ǫ1, . . . , ǫn)T is a vector of independent and identically distributed (i.i.d.) random variables with mean 0 and variance σ2
As we show in Theorem 3 and Corollary 2, these two post-Lasso estimators display an oracle property that the Lasso does not have: they can select the true predictors with probability converging to 1 and the estimates of nonzero parameters have the same asymptotic normal distribution that they would have if the zero parameters were known in advance
We have derived for the first time the asymptotic properties of Lasso+modified Least Squares (mLS) and Lasso+Ridge in sparse high-dimensional linear regression models where p ≫ n

Summary

Introduction

Thresholded Lasso and Dantzig estimators were introduced in [31] and the authors proved their model selection consistency under less restrictive conditions on the decay rates of the nonzero regression coefficients. We should mention that previous work [3] has obtained l2 convergence rate (||βLasso+OLS − β∗||22 = Op(s/n)) of Lasso+OLS estimator under weaker conditions Their results hold in probability and it is not clear whether Lasso+OLS can achieve the oracle convergence rate of O(s/n) in L2-expectation, i.e., whether E||β− β∗||22 = O(s/n) holds, which we need to prove the validity of residual bootstrap. We begin with a precise definition of the modified Least Squares or Ridge after model selection, and study their asymptotic properties, including asymptotic unbiasedness, asymptotic normality and the validity of residual bootstrap

Definitions and assumptions

C11 C12 C21 C22

Simulation

Finite sample distribution

Confidence intervals and coverage probabilities

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2013
Citations: 83	License type: cc-by

R Discovery Prime

R Discovery Prime

Asymptotic properties of Lasso+mLS and Lasso+Ridge in sparse high-dimensional linear regression

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

Are Latent Factor Regression and Sparse Regression Adequate?
Jianqing Fan ... Mengxin Yu
Journal of the American Statistical Association | VOL. 119
Jianqing Fan, et. al.Jianqing Fan ... Mengxin Yu
17 Jan 2023
Journal of the American Statistical Association | VOL. 119

New Improved Criterion for Model Selection in Sparse High-Dimensional Linear Regression Models
Prakash B Gohain ... Magnus Jansson
-
Prakash B Gohain, et. al.Prakash B Gohain ... Magnus Jansson
23 May 2022
23 May 2022

Multiple outliers detection in sparse high-dimensional regression
Tao Wang ... Zhonghua Li
Journal of Statistical Computation and Simulation | VOL. 88
Tao Wang, et. al.Tao Wang ... Zhonghua Li
20 Sep 2017
Journal of Statistical Computation and Simulation | VOL. 88

Asymptotic properties of adaptive Dantzig selector
Li Feng ... Lin Lu
SCIENTIA SINICA Mathematica | VOL. 47
Li Feng, et. al.Li Feng ... Lin Lu
10 May 2017
SCIENTIA SINICA Mathematica | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Asymptotic properties of Lasso+mLS and Lasso+Ridge in sparse high-dimensional linear regression

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics