Abstract
BackgroundA multivariate genome-wide association test is proposed for analyzing data on multivariate quantitative phenotypes collected from related subjects. The proposed method is a two-step approach. The first step models the association between the genotype and marginal phenotype using a linear mixed model. The second step uses the correlation between residuals of the linear mixed model to estimate the null distribution of the Fisher combination test statistic.ResultsThe simulation results show that the proposed method controls the type I error rate and is more powerful than the marginal tests across different population structures (admixed or non-admixed) and relatedness (related or independent). The statistical analysis on the database of the Study of Addiction: Genetics and Environment (SAGE) demonstrates that applying the multivariate association test may facilitate identification of the pleiotropic genes contributing to the risk for alcohol dependence commonly expressed by four correlated phenotypes.ConclusionsThis study proposes a multivariate method for identifying pleiotropic genes while adjusting for cryptic relatedness and population structure between subjects. The two-step approach is not only powerful but also computationally efficient even when the number of subjects and the number of phenotypes are both very large.
Highlights
A multivariate genome-wide association test is proposed for analyzing data on multivariate quantitative phenotypes collected from related subjects
Discussion the simulation shows that adding principal components as covariates to the linear mixed model did not substantially improve the accuracy of estimating the correlation between phenotypes, it can adjust for potential population structure and cryptic relatedness in genome-wide association study (GWAS) as well as improve the estimation of marginal genetic effects [15]
In this study, we propose a new multivariate method for GWAS when multivariate quantitative phenotypes are used to indicate the risk for a complex disease and the data are collected from related subjects
Summary
A multivariate genome-wide association test is proposed for analyzing data on multivariate quantitative phenotypes collected from related subjects. The first step models the association between the genotype and marginal phenotype using a linear mixed model. The second step uses the correlation between residuals of the linear mixed model to estimate the null distribution of the Fisher combination test statistic. Unlike linkage analysis that requires data collection from genetically related subjects, GWAS is applicable to a more general setting involving independent subjects. If existing methods designed for independent subjects are adopted to analyze these data, the power of association tests will be greatly reduced because only a subset of data can be used. Yu et al (2006) [9] proposed a compromise between
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.