Abstract

BackgroundA multivariate genome-wide association test is proposed for analyzing data on multivariate quantitative phenotypes collected from related subjects. The proposed method is a two-step approach. The first step models the association between the genotype and marginal phenotype using a linear mixed model. The second step uses the correlation between residuals of the linear mixed model to estimate the null distribution of the Fisher combination test statistic.ResultsThe simulation results show that the proposed method controls the type I error rate and is more powerful than the marginal tests across different population structures (admixed or non-admixed) and relatedness (related or independent). The statistical analysis on the database of the Study of Addiction: Genetics and Environment (SAGE) demonstrates that applying the multivariate association test may facilitate identification of the pleiotropic genes contributing to the risk for alcohol dependence commonly expressed by four correlated phenotypes.ConclusionsThis study proposes a multivariate method for identifying pleiotropic genes while adjusting for cryptic relatedness and population structure between subjects. The two-step approach is not only powerful but also computationally efficient even when the number of subjects and the number of phenotypes are both very large.

Highlights

  • A multivariate genome-wide association test is proposed for analyzing data on multivariate quantitative phenotypes collected from related subjects

  • Discussion the simulation shows that adding principal components as covariates to the linear mixed model did not substantially improve the accuracy of estimating the correlation between phenotypes, it can adjust for potential population structure and cryptic relatedness in genome-wide association study (GWAS) as well as improve the estimation of marginal genetic effects [15]

  • In this study, we propose a new multivariate method for GWAS when multivariate quantitative phenotypes are used to indicate the risk for a complex disease and the data are collected from related subjects

Read more

Summary

Introduction

A multivariate genome-wide association test is proposed for analyzing data on multivariate quantitative phenotypes collected from related subjects. The first step models the association between the genotype and marginal phenotype using a linear mixed model. The second step uses the correlation between residuals of the linear mixed model to estimate the null distribution of the Fisher combination test statistic. Unlike linkage analysis that requires data collection from genetically related subjects, GWAS is applicable to a more general setting involving independent subjects. If existing methods designed for independent subjects are adopted to analyze these data, the power of association tests will be greatly reduced because only a subset of data can be used. Yu et al (2006) [9] proposed a compromise between

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call