CANONICAL THRESHOLDING FOR NON-SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION.

Igor Silin,Jianqing Fan

doi:10.1214/21-aos2116

Abstract

We consider a high-dimensional linear regression problem. Unlike many papers on the topic, we do not require sparsity of the regression coefficients; instead, our main structural assumption is a decay of eigenvalues of the covariance matrix of the data. We propose a new family of estimators, called the canonical thresholding estimators, which pick largest regression coefficients in the canonical form. The estimators admit an explicit form and can be linked to LASSO and Principal Component Regression (PCR). A theoretical analysis for both fixed design and random design settings is provided. Obtained bounds on the mean squared error and the prediction error of a specific estimator from the family allow to clearly state sufficient conditions on the decay of eigenvalues to ensure convergence. In addition, we promote the use of the relative errors, strongly linked with the out-of-sample R 2. The study of these relative errors leads to a new concept of joint effective dimension, which incorporates the covariance of the data and the regression coefficients simultaneously, and describes the complexity of a linear regression problem. Some minimax lower bounds are established to showcase the optimality of our procedure. Numerical simulations confirm good performance of the proposed estimators compared to the previously developed methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CANONICAL THRESHOLDING FOR NON-SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION.

Abstract

Talk to us

Similar Papers

More From: The Annals of Statistics

Lead the way for us

Journal: The Annals of Statistics	Publication Date: Feb 1, 2022
Citations: 2

Similar Papers

Comparison of Least Squares and Principal Components Regression Analysis
Zeynep Küçükakçali ... Harika Gözükara Bağ
The Journal of Cognitive Systems | VOL. 7
Zeynep Küçükakçali, et. al.Zeynep Küçükakçali ... Harika Gözükara Bağ
03 Jul 2022
The Journal of Cognitive Systems | VOL. 7

Quantum Algorithms for Solving Linear Regression Equation
Kai Li ... Feng Jing
Journal of Physics: Conference Series | VOL. 1738
Kai Li, et. al.Kai Li ... Feng Jing
01 Jan 2020
Journal of Physics: Conference Series | VOL. 1738

Methods for analyzing data from probabilistic linkage strategies based on partially identifying variables
M H P Hof ... A H Zwinderman
Statistics in Medicine | VOL. 31
M H P Hof, et. al.M H P Hof ... A H Zwinderman
16 Jul 2012
Statistics in Medicine | VOL. 31

Random Projections for Large-Scale Regression
Gian-Andrea Thanei ... Christina Heinze
-
Gian-Andrea Thanei, et. al.Gian-Andrea Thanei ... Christina Heinze
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CANONICAL THRESHOLDING FOR NON-SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION.

Abstract

Talk to us

Similar Papers

More From: The Annals of Statistics