Abstract
The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks. Classical estimates of heritability require twin or pedigree data, which can be costly and difficult to acquire. Genome-wide complex trait analysis is an alternative tool to compute heritability estimates from unrelated individuals, using genome-wide data that are increasingly ubiquitous, but is computationally demanding and becomes difficult to apply in evaluating very large numbers of phenotypes. Here we present a fast and accurate statistical method for high-dimensional heritability analysis using genome-wide SNP data from unrelated individuals, termed massively expedited genome-wide heritability analysis (MEGHA) and accompanying nonparametric sampling techniques that enable flexible inferences for arbitrary statistics of interest. MEGHA produces estimates and significance measures of heritability with several orders of magnitude less computational time than existing methods, making heritability-based prioritization of millions of phenotypes based on data from unrelated individuals tractable for the first time to our knowledge. As a demonstration of application, we conducted heritability analyses on global and local morphometric measurements derived from brain structural MRI scans, using genome-wide SNP data from 1,320 unrelated young healthy adults of non-Hispanic European ancestry. We also computed surface maps of heritability for cortical thickness measures and empirically localized cortical regions where thickness measures were significantly heritable. Our analyses demonstrate the unique capability of MEGHA for large-scale heritability-based screening and high-dimensional heritability profile construction.
Highlights
The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks
As genome-wide data became widely available, genome-wide complex trait analysis (GCTA) [5, 6] was developed, which assesses aggregate effects of common SNPs spanning the genome on phenotypes and provides an SNP-based heritability estimate, a lower bound for narrow-sense heritability
intracranial volume (ICV), total brain volume, and mean cortical thickness measures are highly heritable, with familywise error corrected (FWEc) significant P values computed by the proposed permutation procedure (Materials and Methods)
Summary
The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks. As genome-wide data became widely available, genome-wide complex trait analysis (GCTA) [5, 6] was developed, which assesses aggregate effects of common SNPs spanning the genome on phenotypes and provides an SNP-based heritability estimate, a lower bound for narrow-sense heritability. This method has been successfully applied to the heritability analyses of several complex traits and mental disorders [5, 7, 8] and has been used to investigate the puzzle of “missing heritability” [5, 9, 10]. MEGHA provides both magnitude estimates and significance measures of heritability with orders of magnitude less computational effort relative to GCTA, making it possible to analyze millions of phenotypes and develop sampling techniques that produce accurate inferences for Significance
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.