Rank-transformed subsampling: inference for multiple data splitting and exchangeable p-values

F Richard Guo,Rajen D Shah

doi:10.1093/jrsssb/qkae091

Abstract

Abstract Many testing problems are readily amenable to randomized tests, such as those employing data splitting. However, despite their usefulness in principle, randomized tests have obvious drawbacks. Firstly, two analyses of the same dataset may lead to different results. Secondly, the test typically loses power because it does not fully utilize the entire sample. As a remedy to these drawbacks, we study how to combine the test statistics or p-values resulting from multiple random realizations, such as through random data splits. We develop rank-transformed subsampling as a general method for delivering large-sample inference about the combined statistic or p-value under mild assumptions. We apply our methodology to a wide range of problems, including testing unimodality in high-dimensional data, testing goodness-of-fit of parametric quantile regression models, testing no direct effect in a sequentially randomized trial and calibrating cross-fit double machine learning confidence intervals. In contrast to existing p-value aggregation schemes that can be highly conservative, our method enjoys Type I error control that asymptotically approaches the nominal level. Moreover, compared to using the ordinary subsampling, we show that our rank transform can remove the first-order bias in approximating the null under alternatives and greatly improve power.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of the Royal Statistical Society Series B: Statistical Methodology	Publication Date: Sep 18, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Rank-transformed subsampling: inference for multiple data splitting and exchangeable p-values

Abstract

Talk to us

Similar Papers

More From: Journal of the Royal Statistical Society Series B: Statistical Methodology

Lead the way for us

Similar Papers

تقديرات النماذج شبه المعلمية بالتطبيق على بيانات العشارى
فاطمة علي محمد عبد العاطي ... منى محمد أحمد بصل
المجلة المصرية للدراسات التجارية | VOL. 44
فاطمة علي محمد عبد العاطي, et. al.فاطمة علي محمد عبد العاطي ... منى محمد أحمد بصل
01 Aug 2021
المجلة المصرية للدراسات التجارية | VOL. 44

Flexible Parametric Accelerated Hazard Model: Simulation and Application to Censored Lifetime Data with Crossing Survival Curves
Abdisalam Hassan Muse ... Oscar Ngesa
Mathematical and Computational Applications | VOL. 27
Abdisalam Hassan Muse, et. al.Abdisalam Hassan Muse ... Oscar Ngesa
30 Nov 2022
Mathematical and Computational Applications | VOL. 27

An overview on parametric quantile regression models and their computational implementation with applications to biomedical problems including COVID-19 data
Josmar Mazucheli ... Víctor Leiva
Computer Methods and Programs in Biomedicine | VOL. 221
Josmar Mazucheli, et. al.Josmar Mazucheli ... Víctor Leiva
25 Apr 2022
Computer Methods and Programs in Biomedicine | VOL. 221

Using the Bayesian method to estimate and comparison the regression of the exponential and gamma survival (Simulation)
Ahmed Salam Mezher ... Wadhah S Ibrahim
Journal of the College of Basic Education | VOL. 30
Ahmed Salam Mezher, et. al. Ahmed Salam Mezher ... Wadhah S Ibrahim
23 Jun 2024
Journal of the College of Basic Education | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rank-transformed subsampling: inference for multiple data splitting and exchangeable p-values

Abstract

Talk to us

Similar Papers

More From: Journal of the Royal Statistical Society Series B: Statistical Methodology