Abstract

Genome-wide association (GWA) studies have been limited by the reliance on common variants present on microarrays or imputable from the HapMap Project data. More recently, the completion of the 1000 Genomes Project has provided variant and haplotype information for several million variants derived from sequencing over 1,000 individuals. To help understand the extent to which more variants (including low frequency (1% ≤ MAF <5%) and rare variants (<1%)) can enhance previously identified associations and identify novel loci, we selected 93 quantitative circulating factors where data was available from the InCHIANTI population study. These phenotypes included cytokines, binding proteins, hormones, vitamins and ions. We selected these phenotypes because many have known strong genetic associations and are potentially important to help understand disease processes. We performed a genome-wide scan for these 93 phenotypes in InCHIANTI. We identified 21 signals and 33 signals that reached P<5×10−8 based on HapMap and 1000 Genomes imputation, respectively, and 9 and 11 that reached a stricter, likely conservative, threshold of P<5×10−11 respectively. Imputation of 1000 Genomes genotype data modestly improved the strength of known associations. Of 20 associations detected at P<5×10−8 in both analyses (17 of which represent well replicated signals in the NHGRI catalogue), six were captured by the same index SNP, five were nominally more strongly associated in 1000 Genomes imputed data and one was nominally more strongly associated in HapMap imputed data. We also detected an association between a low frequency variant and phenotype that was previously missed by HapMap based imputation approaches. An association between rs112635299 and alpha-1 globulin near the SERPINA gene represented the known association between rs28929474 (MAF = 0.007) and alpha1-antitrypsin that predisposes to emphysema (P = 2.5×10−12). Our data provide important proof of principle that 1000 Genomes imputation will detect novel, low frequency-large effect associations.

Highlights

  • Genome-wide association (GWA) studies have identified many novel associations between common genetic variants and human traits

  • Does 1000 Genomes imputation identify stronger associations at loci identified by HapMap based imputation? We identified 14 loci associated with 13 traits where both the HapMap and 1000 Genomes index SNPs were P,561028 and differed from each other (Table 4 and Table 6)

  • Our results show that imputation of genotype data from the sequenced based 1000 Genomes reference panel successfully captures many more variants than imputation of genotype data from the microarray based HapMap reference panel

Read more

Summary

Introduction

Genome-wide association (GWA) studies have identified many novel associations between common genetic variants and human traits. These studies capture a large proportion of SNP-based common variation in the human genome but are less efficient at capturing low frequency and rare variants. Imputation is useful for performing GWA metaanalyses across studies with the same traits but different genotyping arrays. Imputation can potentially identify signals of association not detected by direct genotypes. A causal variant, or the most strongly associated genetic variant, may not be directly genotyped and may exist on a haplotype that is optimally captured by two rather than one directly genotyped SNP (Figure 1)

Objectives
Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.