Abstract

We propose a multivariate genome-wide association test for mixed continuous, binary, and ordinal phenotypes. A latent response model is used to estimate the correlation between phenotypes with different measurement scales so that the empirical distribution of the Fisher’s combination statistic under the null hypothesis is estimated efficiently. The simulation study shows that our proposed correlation estimation methods have high levels of accuracy. More importantly, our approach conservatively estimates the variance of the test statistic so that the type I error rate is controlled. The simulation also shows that the proposed test maintains the power at the level very close to that of the ideal analysis based on known latent phenotypes while controlling the type I error. In contrast, conventional approaches–dichotomizing all observed phenotypes or treating them as continuous variables–could either reduce the power or employ a linear regression model unfit for the data. Furthermore, the statistical analysis on the database of the Study of Addiction: Genetics and Environment (SAGE) demonstrates that conducting a multivariate test on multiple phenotypes can increase the power of identifying markers that may not be, otherwise, chosen using marginal tests. The proposed method also offers a new approach to analyzing the Fagerström Test for Nicotine Dependence as multivariate phenotypes in genome-wide association studies.

Highlights

  • Since the first genome-wide association study (GWAS) [1], more than 2,000 loci have been identified to be significantly associated with one or more complex traits [2]

  • We extend the Fisher combination function method originally designed for continuous multivariate phenotypes [6] to handle mixed continuous, binary, or ordinal multivariate phenotypes

  • We conducted a simulation study to evaluate the accuracy of different correlation estimation methods for mixed continuous, binary, and ordinal data described in the previous section

Read more

Summary

Introduction

Since the first genome-wide association study (GWAS) [1], more than 2,000 loci have been identified to be significantly associated with one or more complex traits [2]. Researchers have focused on genes associated with well defined functions or specific traits. A systematic review showed that many loci are associated with multiple traits [3]. This motivates researchers to study pleiotropy which is a condition in which a single gene affects multiple traits. A well-known example published on Nature is a GWAS involving 107 phenotypes that identified multiple pleiotropy genes [4]. From a statistical point of view, for complex

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.