Abstract

Over one thousand genome-wide association studies (GWAS) have been conducted in the past decade. Increasing biological evidence suggests the polygenic genetic architecture of complex traits: a complex trait is affected by many risk variants with small or moderate effects jointly. Meanwhile, recent progress in GWAS suggests that complex human traits may share common genetic bases, which is known as “pleiotropy”. To further improve statistical power of detecting risk genetic variants in GWAS, we propose a penalized regression method to analyze the GWAS dataset of primary interest by incorporating information from other related GWAS. The proposed method does not require the individual-level of genotype and phenotype data from other related GWAS, making it useful when only summary statistics are available. The key idea of the proposed approach is that related traits may share common genetic basis. Specifically, we propose a linear model for integrative analysis of multiple GWAS, in which risk genetic variants can be detected via identification of nonzero coefficients. Due to the pleiotropy effect, there exist genetic variants affecting multiple traits, which correspond to a consistent nonzero pattern of coefficients across multiple GWAS. To achieve this, we use a group Lasso penalty to identify this nonzero pattern in our model, and then develop an efficient algorithm based on the proximal gradient method. Simulation studies showed that the proposed approach had satisfactory performance. We applied the proposed method to analyze a body mass index (BMI) GWAS dataset from a European American (EA) population and achieved improvement over single GWAS analysis.

Highlights

  • Genome-wide association studies (GWAS) provide an unprecedented opportunity for identifying disease-associated genetic variants

  • We propose a penalized proposed approach is that genetically related traits can share common genetic bases [18,20], which enables us to borrow information from some related GWAS when analyzing the trait of primary interest

  • GWAS suffer from low statistical power due to the individual weak effects of genetic variants

Read more

Summary

Introduction

Genome-wide association studies (GWAS) provide an unprecedented opportunity for identifying disease-associated genetic variants. A systematic investigation of pleiotropy [18] suggests that 16.9% of genes and 4.6% of SNPs have been reported to statistical power in GWAS data analysis by integrating multiple from two aspects. We propose a penalized proposed approach is that genetically related traits can share common genetic bases [18,20], which enables us to borrow information from some related GWAS when analyzing the trait of primary interest. For single GWAS analysis, many existing statistical methods have been proposed [9,10]. Due to limited sample size of a single GWAS and polygenicity of a complex trait, many existing methods do not have enough power to uncover the. GWAS together and use a group-Lasso penalty to integrate information study and real data analysis, we showed that the proposed method had advantage over single-GWAS analysis

Material and Method
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call