Abstract

Methods that can evaluate aggregate effects of rare and common variants are limited. Therefore, we applied a two-stage approach to evaluate aggregate gene effects in the 1000 Genomes Project data, which contain 24,487 single-nucleotide polymorphisms (SNPs) in 697 unrelated individuals from 7 populations. In stage 1, we identified potentially interesting genes (PIGs) as those having at least one SNP meeting Bonferroni correction using univariate, multiple regression models. In stage 2, we evaluate aggregate PIG effects on trait, Q1, by modeling each gene as a latent construct, which is defined by multiple common and rare variants, using the multivariate statistical framework of structural equation modeling (SEM). In stage 1, we found that PIGs varied markedly between a randomly selected replicate (replicate 137) and 100 other replicates, with the exception of FLT1. In stage 1, collapsing rare variants decreased false positives but increased false negatives. In stage 2, we developed a good-fitting SEM model that included all nine genes simulated to affect Q1 (FLT1, KDR, ARNT, ELAV4, FLT4, HIF1A, HIF3A, VEGFA, VEGFC) and found that FLT1 had the largest effect on Q1 (βstd = 0.33 ± 0.05). Using replicate 137 estimates as population values, we found that the mean relative bias in the parameters (loadings, paths, residuals) and their standard errors across 100 replicates was on average, less than 5%. Our latent variable SEM approach provides a viable framework for modeling aggregate effects of rare and common variants in multiple genes, but more elegant methods are needed in stage 1 to minimize type I and type II error.

Highlights

  • The 1000 Genomes Project is an international publicprivate consortium aiming to build the most detailed map of human genetic variation with the overarching goal to improve our understanding of the genetic contribution to common human diseases

  • We found that several genes had a least one single-nucleotide polymorphisms (SNPs) meeting or exceeding the Bonferroni-corrected level with (p ≤ 8.33 × 10−6) and without (p ≤ 2.04 × 10−6) collapsing rare variants (MAF < 0.05) (Table 1), but the most significant associations were observed with common (C13S522, C13S523) and rare (C1S3524) variants in FLT1 (Table 2)

  • In stage 2, when building the FLT1 construct using replicate 137, we found that adding rare variants to the common variants improved the model fit (CFI = 0.90, root mean-square error of approximation (RMSEA) = 0.03, and SRMR = 0.08 in Figure 1A vs. comparative fit index (CFI) = 0.96, RMSEA = 0.02, and SRMR = 0.05 in Figure 1B), improved construct reliability (Cronbach’s a: 0.40 (A) vs. 0.53 (B)), and increased the variance explained in Q1 (R2: 0.30 ± 0.04 (A) vs. 0.36 ± 0.04 (B))

Read more

Summary

Introduction

The 1000 Genomes Project is an international publicprivate consortium aiming to build the most detailed map of human genetic variation with the overarching goal to improve our understanding of the genetic contribution to common human diseases. Strategies have been developed to evaluate the contribution of rare variants to disease susceptibility in nonfamilial data, including collapsing methods, which are reviewed by Dering et al [2], approaches that evaluate the combined or aggregate effects of rare and common variants together are limited. In this paper we aim to evaluate the aggregate effects of rare and common single-nucleotide polymorphisms (SNPs) in genes on the simulated quantitative trait Q1 using the Pilot Project 3 data (unrelated subjects). In stage 1 we use multiple regression methods (with and without collapsing rare variants) to identify potentially interesting genes (PIGs); in stage 2, we use a latent variable structural equation modeling (SEM) approach to evaluate aggregate effects of rare and common variants in PIGs on Q1. We used knowledge that 39 SNPs in 9 genes, primarily in the vascular endothelial growth factor (VEGF) pathway, were simulated to be associated with Q1

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call