Unbiased Estimation for Linear Regression When n &lt; v

Saeed Aldahmani,Hongsheng Dai

doi:10.5539/ijsp.v4n3p61

Unbiased Estimation for Linear Regression When n < v

Saeed Aldahmani, Hongsheng Dai

Open Access

https://doi.org/10.5539/ijsp.v4n3p61

Copy DOI

Abstract

In this paper a new method is proposed for solving the linear regression problem when the number of observations $n$ is smaller than the number of predictors v. This method uses the idea of graphical models and provides unbiased parameter estimates under certain conditions, while existing methods such as ridge regression, LASSO and least angle regression (LARS) give biased estimates. Also the new method can provide a detailed graphical correlation structure for the predictors, therefore the real causal relationship between predictors and response could be identified. In contrast, existing methods often cannot identify the real important predictors which have possible causal effects on the response variable. Unlike the existing methods based on graphical models, the proposed method can identify the potential networks while doing regression even if the data do not follow a multivariate distribution. The new method is compared with some existing methods such as ridge regression, LASSO and LARS by using simulated and real data sets. Our experiments reveal that the new method outperforms all the other methods when n<v.

Highlights

Consider a linear regression model with a univariate response, v covariates and n independent and identically distributed (i.i.d.) observations
When n < v, many methods have been proposed for the above models, such as Least Absolute Shrinkage and Selection Operator (LASSO) (Tibshirani, 1996), Least Angle Regression (LARS) (Efron et al, 2004) and ridge regression (Hoerl & Kennard, 1970)
We provide detailed simulation study to show that our GLSE has much smaller bias than other existing methods such as least absolute shrinkage and selection operator (LASSO), ridge regression and least angle regression (LARS)

Summary

Introduction

Consider a linear regression model with a univariate response, v covariates and n independent and identically distributed (i.i.d.) observations. The selected model based on LASSO and LARS can take at most n covariates (Zou & Hastie, 2005; McCann & Welsch, 2007) This will be problematic in some areas where more or even all covariates have to be included in the model. Ridge regression can include all covariates in the model, but the biased estimate makes it difficult to justify the significance levels for each covariate. This can lead to a non-sparse model which is difficult to interpret when the number of features is large (Yuan et al, 2007). Their estimates are still biased which might not be recommended in general (Washington et al, 2010; Zhang, 2010)

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Statistics and Probability	Publication Date: Jul 1, 2015
Citations: 15	License type: cc-by

R Discovery Prime

R Discovery Prime

Unbiased Estimation for Linear Regression When n < v

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Statistics and Probability

Lead the way for us

Similar Papers

Pathwise least angle regression and a significance test for the elastic net
Muhammad Naveed Tabassum ... Esa Ollila
-
Muhammad Naveed Tabassum, et. al.Muhammad Naveed Tabassum ... Esa Ollila
01 Aug 2017
01 Aug 2017

Least angle regression, relaxed lasso, and elastic net for algebraic multigrid of systems of elliptic partial differential equations
Barry Lee
Numerical Linear Algebra with Applications | VOL. -
Barry LeeBarry Lee
18 Jun 2024
Numerical Linear Algebra with Applications | VOL. -

Conjugate Direction Boosting
Roman Werner Lutz ... Peter Bühlmann
Journal of Computational and Graphical Statistics | VOL. 15
Roman Werner Lutz, et. al.Roman Werner Lutz ... Peter Bühlmann
01 Jun 2006
Journal of Computational and Graphical Statistics | VOL. 15

Model selection procedure for high‐dimensional data
Yongli Zhang ... Xiaotong Shen
Statistical Analysis and Data Mining: The ASA Data Science Journal | VOL. 3
Yongli Zhang, et. al.Yongli Zhang ... Xiaotong Shen
08 Sep 2010
Statistical Analysis and Data Mining: The ASA Data Science Journal | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unbiased Estimation for Linear Regression When n &lt; v

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Statistics and Probability

Unbiased Estimation for Linear Regression When n < v