Null Distribution of the Test Statistic for Model Selection via Marginal Screening: Implications for Multivariate Regression Analysis

A V Rubanovich,V A Saenko

doi:10.34257/gjsfrgvol21is1pg23

Abstract

Marginal screening (MS) is the computationally simple and commonly used for the dimension reduction procedures. In it, a linear model is constructed for several top predictors, chosen according to the absolute value of marginal correlations with the dependent variable. Importantly, when kpredictors out of mprimary covariates are selected, the standard regression analysis may yield false-positive results if m>> k(Freedman's paradox). In this work, we provide analytical expressions describing null distribution of the test statistics for model selection via MS. Using the theory of order statistics, we show that under MS, the common F-statistic is distributed as a mean of ktop variables out of mindependent random variables having a 21χdistribution. Based on this finding, we estimated critical p-values for multiple regression models after MS, comparisons with which of those obtained in real studies will help researchers to avoid false-positive result. Analytical solutions obtained in the work are implemented in a free Excel spreadsheet program.

Highlights

Marginal screening (MS) is the simplest and most commonly used method of variable selection (Hastie, Tibshirani, 2003; Genovese et al, 2009, 2012; Leek, 2012)
A typical situation is when the number of objects is several orders of magnitude less than the number of covariates from which a statistically significant combination of predictors is derived. This is another side of an old problem of multiple comparisons, Author α: Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia. e-mail: rubanovich@vigg.ru Author σ: Department of Radiation Molecular Epidemiology, Atomic Bomb Disease Institute, Nagasaki University, Nagasaki, Japan
The purpose of this work was to explicitly address null-distribution of the F-statistic, which is used for testing the significance of a model (1), when model selection is performed with MS

Summary

Introduction

Marginal screening (MS) is the simplest and most commonly used method of variable selection (Hastie, Tibshirani, 2003; Genovese et al, 2009, 2012; Leek, 2012). MS is intuitively preferred by the researchers, when the number of objects (e.g. participants of a study, samples or outcomes) is much smaller than the number of explanatory variables (the so-called “n

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Null Distribution of the Test Statistic for Model Selection via Marginal Screening: Implications for Multivariate Regression Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Global Journal of Science Frontier Research

Lead the way for us

Journal: Global Journal of Science Frontier Research	Publication Date: Oct 21, 2021
License type: CC BY 4.0

Similar Papers

Regression and residual analysis in linear models with interval censored data

-

14 Dec 2004
14 Dec 2004

The robust beauty of improper linear models in decision making.
Robyn M Dawes
American Psychologist | VOL. 34
Robyn M DawesRobyn M Dawes
01 Jan 1979
American Psychologist | VOL. 34

Harmonic tonal detectors based on the BOGA
Lu Wang ... Guoan Bi
Signal Processing | VOL. 106
Lu Wang, et. al.Lu Wang ... Guoan Bi
11 Aug 2014
Signal Processing | VOL. 106

An Upside-Down Bathtub-Shaped Failure Rate Model Using a DUS Transformation of Lomax Distribution
K.S Deepthi ... V.M Chacko
-
K.S Deepthi, et. al.K.S Deepthi ... V.M Chacko
29 Jul 2020
29 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Null Distribution of the Test Statistic for Model Selection via Marginal Screening: Implications for Multivariate Regression Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Global Journal of Science Frontier Research