ON BLOCKWISE AND REFERENCE PANEL-BASED ESTIMATORS FOR GENETIC DATA PREDICTION IN HIGH DIMENSIONS.

Bingxin Zhao,Shurong Zheng,Hongtu Zhu

doi:10.1214/24-aos2378

Abstract

Genetic prediction holds immense promise for translating genetic discoveries into medical advances. As the high-dimensional covariance matrix (or the linkage disequilibrium (LD) pattern) of genetic variants often presents a block-diagonal structure, numerous methods account for the dependence among variants in predetermined local LD blocks. Moreover, due to privacy considerations and data protection concerns, genetic variant dependence in each LD block is typically estimated from external reference panels rather than the original training data set. This paper presents a unified analysis of blockwise and reference panel-based estimators in a high-dimensional prediction framework without sparsity restrictions. We find that, surprisingly, even when the covariance matrix has a block-diagonal structure with well-defined boundaries, blockwise estimation methods adjusting for local dependence can be substantially less accurate than methods controlling for the whole covariance matrix. Further, estimation methods built on the original training data set and external reference panels are likely to have varying performance in high dimensions, which may reflect the cost of having only access to summary level data from the training data set. This analysis is based on novel results in random matrix theory for block-diagonal covariance matrix. We numerically evaluate our results using extensive simulations and real data analysis in the UK Biobank.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ON BLOCKWISE AND REFERENCE PANEL-BASED ESTIMATORS FOR GENETIC DATA PREDICTION IN HIGH DIMENSIONS.

Abstract

Talk to us

Similar Papers

More From: Annals of statistics

Lead the way for us

Journal: Annals of statistics	Publication Date: Jun 1, 2024
Citations: 1

Similar Papers

Comparison of genotype imputation strategies using a combined reference panel for chicken population
S Ye ... Z Zhang
Animal | VOL. 13
S Ye, et. al.S Ye ... Z Zhang
01 Jan 2019
Animal | VOL. 13

ATRIUM: Testing Untyped SNPs in Case-Control Association Studies with Related Individuals
Zuoheng Wang ... Mary Sara Mcpeek
The American Journal of Human Genetics | VOL. 85
Zuoheng Wang, et. al.Zuoheng Wang ... Mary Sara Mcpeek
01 Nov 2009
The American Journal of Human Genetics | VOL. 85

Detection of Cancer Recurrence Using Systemic Inflammatory Markers and Machine Learning after Concurrent Chemoradiotherapy for Head and Neck Cancers.
Yoon Kyoung So ... Dongryul Oh
Cancers | VOL. 15
Yoon Kyoung So, et. al.Yoon Kyoung So ... Dongryul Oh
08 Jul 2023
Cancers | VOL. 15

Data augmentation for enhancing EEG-based emotion recognition with deep generative models
Yun Luo ... Zi-Yu Wan
Journal of Neural Engineering | VOL. 17
Yun Luo, et. al.Yun Luo ... Zi-Yu Wan
01 Oct 2020
Journal of Neural Engineering | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ON BLOCKWISE AND REFERENCE PANEL-BASED ESTIMATORS FOR GENETIC DATA PREDICTION IN HIGH DIMENSIONS.

Abstract

Talk to us

Similar Papers

More From: Annals of statistics