A tree-based gene-environment interaction analysis with rare features.

Mengque Liu,Shuangge Ma,Qingzhao Zhang

doi:10.1002/sam.11578

Abstract

Gene-environment (G-E) interaction analysis plays a critical role in understanding and modeling complex diseases. Compared to main-effect-only analysis, it is more seriously challenged by higher dimensionality, weaker signals, and the unique "main effects, interactions" variable selection hierarchy. In joint G-E interaction analysis under which a large number of G factors are analysed in a single model, effort tailored to rare features (e.g., SNPs with low minor allele frequencies) has been limited. Existing investigations on rare features have been mostly focused on marginal analysis, where various data aggregation techniques have been developed, and hypothesis testings have been conducted to identify significant aggregated features. However, such techniques cannot be extended to joint G-E interaction analysis. In this study, building on a very recent tree-based data aggregation technique, which has been developed for main-effect-only analysis, we develop a new G-E interaction analysis approach tailored to rare features. The adopted data aggregation technique allows for more efficient information borrowing from neighboring rare features. Similar to some existing state-of-the-art ones, the proposed approach adopts penalization for variable selection, regularized estimation, and respect of the variable selection hierarchy. Simulation shows that it has more accurate identification of important interactions and main effects than several competing alternatives. In the analysis of NFBC1966 study, the proposed approach leads to findings different from the alternatives and with satisfactory prediction and stability performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A tree-based gene-environment interaction analysis with rare features.

Abstract

Talk to us

Similar Papers

More From: Statistical analysis and data mining

Lead the way for us

Journal: Statistical analysis and data mining	Publication Date: Mar 1, 2022
Citations: 3

Similar Papers

Gene-environment interaction analysis via deep learning.
Shuni Wu ... Shuangge Ma
Genetic Epidemiology | VOL. 47
Shuni Wu, et. al.Shuni Wu ... Shuangge Ma
19 Feb 2023
Genetic Epidemiology | VOL. 47

Pathological imaging-assisted cancer gene-environment interaction analysis.
Kuangnan Fang ... Qingzhao Zhang
Biometrics | VOL. 79
Kuangnan Fang, et. al.Kuangnan Fang ... Qingzhao Zhang
03 May 2023
Biometrics | VOL. 79

Identification of gene-environment interactions with marginal penalization.
Sanguo Zhang ... Yuan Xue
Genetic Epidemiology | VOL. 44
Sanguo Zhang, et. al.Sanguo Zhang ... Yuan Xue
14 Nov 2019
Genetic Epidemiology | VOL. 44

Hierarchical false discovery rate control for high-dimensional survival analysis with interactions
Weijuan Liang ... Shuangge Ma
Computational statistics & data analysis | VOL. 192
Weijuan Liang, et. al.Weijuan Liang ... Shuangge Ma
05 Dec 2023
Computational statistics & data analysis | VOL. 192

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A tree-based gene-environment interaction analysis with rare features.

Abstract

Talk to us

Similar Papers

More From: Statistical analysis and data mining