Abstract
Identifying genetic variants associated with complex diseases is an important task in genetic research. Although association studies based on unrelated individuals (ie, case-control genome-wide association studies) have successfully identified common single-nucleotide polymorphisms for many complex diseases, these studies are not so likely to identify rare genetic variants. In contrast, family-based association studies are particularly useful for identifying rare-variant associations. Recently, there has been some interest in employing multilevel models in family-based genetic association studies. However, the performance of such models in these studies, especially for longitudinal family-based sequence data, has not been fully investigated. Therefore, in this study, we investigated the performance of the multilevel model in the family-based genetic association analysis and compared it with the conventional family-based association test, by examining the powers and type I error rates of the 2 approaches using 3 data sets from the Genetic Analysis Workshop 18 simulated data: genome-wide association single-nucleotide polymorphism data, sequence data, and rare-variants-only data. Compared with the univariate family-based association test, the multilevel model had slightly higher power to identify most of the causal genetic variants using the genome-wide association single-nucleotide polymorphism data and sequence data. However, both approaches had low power to identify most of the causal single-nucleotide polymorphisms, especially those among the relatively rare genetic variants. Therefore, we suggest a unified method that combines both approaches and incorporates collapsing strategy, which may be more powerful than either approach alone for studying genetic associations using family-based data.
Highlights
Identifying genetic variants associated with complex diseases is an important task in genetic studies, including genome-wide association (GWA) studies and whole-genome sequencing studies
We investigated the powers and type I error probabilities of both approaches using simulated GWA, sequence, and rare-variants-only data provided by Genetic Analysis Workshop 18 (GAW18), with knowledge of the simulation model
When using SBP as the phenotype of interest, we found that of the 246 noncausal SNPs, SNPs in the multilevel model analysis, and SNPs in the family-based association test (FBAT) approach had inflated type I error rates
Summary
Identifying genetic variants associated with complex diseases is an important task in genetic studies, including genome-wide association (GWA) studies and whole-genome sequencing studies. There has been some interest in employing multilevel models in family-based genetic association studies [2,4]. The performance of multilevel model analysis in family-based genetic association studies, especially for longitudinal family-based sequence data, has not been fully investigated. In this study, our aim was to examine the performance of multilevel model analysis in family-based association study, compared with that of the more commonly used family-based association test (FBAT) [5,6]. We investigated the powers and type I error probabilities of both approaches using simulated GWA, sequence, and rare-variants-only data provided by Genetic Analysis Workshop 18 (GAW18), with knowledge of the simulation model
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.