Abstract

Polygenic risk scores (PRSs) are rapidly emerging as a way to measure disease risk by aggregating multiple genetic variants. Understanding the interplay of the PRS with environmental factors is critical for interpreting and applying PRSs in a wide variety of settings. We develop an efficient method for simultaneously modeling gene-environment correlations and interactions using the PRS in case-control studies. We use a logistic-normal regression modeling framework to specify the disease risk and PRS distribution in the underlying population and propose joint inference across the 2 models using the retrospective likelihood of the case-control data. Extensive simulation studies demonstrate the flexibility of the method in trading-off bias and efficiency for the estimation of various model parameters compared with standard logistic regression or a case-only analysis for gene-environment interactions, or a control-only analysis, for gene-environment correlations. Finally, using simulated case-control data sets within the UK Biobank study, we demonstrate the power of our method for its ability to recover results from the full prospective cohort for the detection of an interaction between long-term oral contraceptive use and the PRS on the risk of breast cancer. This method is computationally efficient and implemented in a user-friendly R package.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.