Conditional predictive inference post model selection

Hannes Leeb

doi:10.1214/08-aos660

Abstract

We give a finite-sample analysis of predictive inference procedures after model selection in regression with random design. The analysis is focused on a statistically challenging scenario where the number of potentially important explanatory variables can be infinite, where no regularity conditions are imposed on unknown parameters, where the number of explanatory variables in a “good” model can be of the same order as sample size and where the number of candidate models can be of larger order than sample size. The performance of inference procedures is evaluated conditional on the training sample. Under weak conditions on only the number of candidate models and on their complexity, and uniformly over all data-generating processes under consideration, we show that a certain prediction interval is approximately valid and short with high probability in finite samples, in the sense that its actual coverage probability is close to the nominal one and in the sense that its length is close to the length of an infeasible interval that is constructed by actually knowing the “best” candidate model. Similar results are shown to hold for predictive inference procedures other than prediction intervals like, for example, tests of whether a future response will lie above or below a given threshold.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Annals of Statistics	Publication Date: Oct 1, 2009
Citations: 28	License type: implied-oa

R Discovery Prime

R Discovery Prime

Conditional predictive inference post model selection

Abstract

Talk to us

Similar Papers

More From: The Annals of Statistics

Lead the way for us

Similar Papers

Evaluation and selection of models for out-of-sample prediction when the sample size is small relative to the complexity of the data-generating process
Hannes Leeb
Bernoulli | VOL. 14
Hannes LeebHannes Leeb
01 Aug 2008
Bernoulli | VOL. 14

Heteroscedasticity‐robustCpmodel averaging
Qingfeng Liu ... Ryo Okui
The Econometrics Journal | VOL. 16
Qingfeng Liu, et. al.Qingfeng Liu ... Ryo Okui
01 Oct 2013
The Econometrics Journal | VOL. 16

Generalized Cp Model Averaging for Heteroskedastic Models
Qingfeng Liu ... Ryo Okui
SSRN Electronic Journal | VOL. 16
Qingfeng Liu, et. al.Qingfeng Liu ... Ryo Okui
23 Sep 2011
SSRN Electronic Journal | VOL. 16

Prediction Intervals of Response Variables based on Quantiles in High Dimensional Regression Analyses
Septian Rahardiantoro ... Anang Kurnia
IOP Conference Series: Earth and Environmental Science | VOL. 187
Septian Rahardiantoro, et. al.Septian Rahardiantoro ... Anang Kurnia
01 Nov 2018
IOP Conference Series: Earth and Environmental Science | VOL. 187

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Conditional predictive inference post model selection

Abstract

Talk to us

Similar Papers

More From: The Annals of Statistics