Abstract

BackgroundThe Genetic Analysis Workshops (GAW) are a forum for development, testing, and comparison of statistical genetic methods and software. Each contribution to the workshop includes an application to a specified data set. Here we describe the data distributed for GAW19, which focused on analysis of human genomic and transcriptomic data.MethodsGAW19 data were donated by the T2D-GENES Consortium and the San Antonio Family Heart Study and included whole genome and exome sequences for odd-numbered autosomes, measures of gene expression, systolic and diastolic blood pressures, and related covariates in two Mexican American samples. These two samples were a collection of 20 large families with whole genome sequence and transcriptomic data and a set of 1943 unrelated individuals with exome sequence. For each sample, simulated phenotypes were constructed based on the real sequence data. ‘Functional’ genes and variants for the simulations were chosen based on observed correlations between gene expression and blood pressure. The simulations focused primarily on additive genetic models but also included a genotype-by-medication interaction. A total of 245 genes were designated as ‘functional’ in the simulations with a few genes of large effect and most genes explaining < 1 % of the trait variation. An additional phenotype, Q1, was simulated to be correlated among related individuals, based on theoretical or empirical kinship matrices, but was not associated with any sequence variants. Two hundred replicates of the phenotypes were simulated. The GAW19 data are an expansion of the data used at GAW18, which included the family-based whole genome sequence, blood pressure, and simulated phenotypes, but not the gene expression data or the set of 1943 unrelated individuals with exome sequence.

Highlights

  • MethodsGenetic Analysis Workshop 19 (GAW19) data were donated by the T2D-GENES Consortium and the San Antonio Family Heart Study and included whole genome and exome sequences for odd-numbered autosomes, measures of gene expression, systolic and diastolic blood pressures, and related covariates in two Mexican American samples

  • The Genetic Analysis Workshops (GAW) are a forum for development, testing, and comparison of statistical genetic methods and software

  • Data distributed for the workshop included whole genome sequence (WGS) and gene expression in 959 individuals in a set of 20 Mexican American families [1] and whole exome sequence in a set of 1943 unrelated Mexican American individuals drawn from a larger multi-ethnic case/ control study [2]

Read more

Summary

Methods

Whole genome sequence in families The family data set distributed for GAW19 is an expanded version of that used for GAW18, which has been previously described in detail [1]. In the exome data set of unrelated individuals, simulated SBP and DBP were modeled for a single time point, with the same mean, variance, age, sex, and medication effects as in the family sample. In addition to the measured genetic effects generated from the sequence data, an aggregate, unspecified additive genetic residual was modeled based on pedigree-derived kinship estimates in the family data set and on empirical kinship estimates among all pairs of individuals estimated using the program LDAK [13] in the exome data set This residual additive genetic correlation was set to maintain the heritabilities of simulated SBP and DBP and the genetic correlations between them as observed in exam 1 of the family data set. A single time point was simulated for 200 replicates of Q1 in both the family and exome data sets

Background
Findings
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.