Abstract

Genome-wide association studies have facilitated the discovery of thousands of loci for hundreds of phenotypes. However, the issue of missing heritability remains unsolved for most complex traits. Locus discovery could be enhanced with both improved power through multi-phenotype analysis (MPA) and use of a wider allele frequency range, including rare variants (RVs). MPA methods for single-variant association have been proposed, but given their low power for RVs, more efficient approaches are required. We propose multi-phenotype analysis of rare variants (MARV), a burden test-based method for RVs extended to the joint analysis of multiple phenotypes through a powerful reverse regression technique. Specifically, MARV models the proportion of RVs at which minor alleles are carried by individuals within a genomic region as a linear combination of multiple phenotypes, which can be both binary and continuous, and the method accommodates directly the genotyped and imputed data. The full model, including all phenotypes, is tested for association for discovery, and a more thorough dissection of the phenotype combinations for any set of RVs is also enabled. We show, via simulations, that the type I error rate is well controlled under various correlations between two continuous phenotypes, and that the method outperforms a univariate burden test in all considered scenarios. Application of MARV to 4876 individuals from the Northern Finland Birth Cohort 1966 for triglycerides, high- and low-density lipoprotein cholesterols highlights known loci with stronger signals of association than those observed in univariate RV analyses and suggests novel RV effects for these lipid traits.

Highlights

  • In the past decade, thousands of loci for hundreds of phenotypes have been identified through genome-wide association studies (GWAS)

  • We propose multi-phenotype analysis of rare variants (MARV), a burden test-based method for RVs extended to the joint analysis of multiple phenotypes through a powerful reverse regression technique

  • We suggest integrating multi-phenotype analysis (MPA) with RV association analysis to further increase the power of locus discovery and to provide novel biological insights into complex trait genetics

Read more

Summary

Introduction

Thousands of loci for hundreds of phenotypes have been identified through genome-wide association studies (GWAS). A number of methods for multi-phenotype analysis (MPA) have been developed, including dimension reduction, and multivariate and graphical models.[7] In genetics, MPA was first proposed for linkage studies,[2,3,8,9] and recently, a plethora of methods for single-variant MPA of association has been suggested.[10] MPA has been shown to provide a boost in power for locus discovery,[2,3,4,5] as well as increased precision of parameter estimates.[9] There is a biological advantage in analysing correlated traits jointly, since detecting loci that affect a combination of phenotypes could provide suggestions of pleiotropic effects,[7] that is, one locus affecting multiple phenotypes in parallel. MPA methods will be useful for the omics data in order to reduce dimensionality and avoid the penalties posed by correction for multiple testing

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call