A post-screening diagnostic study for ultrahigh dimensional data

Yaowu Zhang,Yeqing Zhou,Liping Zhu

doi:10.1016/j.jeconom.2022.09.005

Abstract

We propose a consistent lack-of-fit test to assess whether replacing the original ultrahigh dimensional covariates with a given number of linear combinations results in a loss of regression information. To attenuate the spurious correlations that may inflate type-I error rates in high dimensions, we suggest to randomly split the observations into two parts. In the first part, we screen out as many irrelevant covariates as possible. This screening step helps to reduce the ultrahigh dimensionality to a moderate scale. In the second part, we perform a lack-of-fit test for conditional independence in the context of sufficient dimension reduction. In case that some important covariates are missed with a non-ignorable probability in the first screening stage, we introduce a multiple splitting procedure. We further propose a new statistic to test for conditional independence, which is shown to be n-consistent under the null and root-n-consistent under the alternative. We develop a consistent bootstrap procedure to approximate the asymptotic null distribution. The performances of our proposal are evaluated through comprehensive simulations and an empirical analysis of GDP data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A post-screening diagnostic study for ultrahigh dimensional data

Abstract

Talk to us

Similar Papers

More From: Journal of Econometrics

Lead the way for us

Journal: Journal of Econometrics	Publication Date: Nov 21, 2022
Citations: 2

Similar Papers

Test for conditional independence with application to conditional screening
Yeqing Zhou ... Liping Zhu
Journal of Multivariate Analysis | VOL. 175
Yeqing Zhou, et. al.Yeqing Zhou ... Liping Zhu
09 Oct 2019
Journal of Multivariate Analysis | VOL. 175

Causal Discovery Using Weight-Based Conditional Independence Test
Zhaolong Ling ... Yuee Huang
ACM Transactions on Knowledge Discovery from Data | VOL. -
Zhaolong Ling, et. al.Zhaolong Ling ... Yuee Huang
28 Aug 2024
ACM Transactions on Knowledge Discovery from Data | VOL. -

Conditional Independence Test Based on Residual Similarity
Hao Zhang ... Jihong Guan
ACM Transactions on Knowledge Discovery from Data | VOL. 17
Hao Zhang, et. al.Hao Zhang ... Jihong Guan
28 Jun 2023
ACM Transactions on Knowledge Discovery from Data | VOL. 17

Estimating and Controlling the False Discovery Rate of the PC Algorithm Using Edge-specific P-Values
Eric V Strobl ... Peter L Spirtes
ACM Transactions on Intelligent Systems and Technology | VOL. 10
Eric V Strobl, et. al.Eric V Strobl ... Peter L Spirtes
30 Sep 2019
ACM Transactions on Intelligent Systems and Technology | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A post-screening diagnostic study for ultrahigh dimensional data

Abstract

Talk to us

Similar Papers

More From: Journal of Econometrics