Inflated type I error rates when using aggregation methods to analyze rare variants in the 1000 Genomes Project exon sequencing data in unrelated individuals: summary results from Group 7 at Genetic Analysis Workshop 17.

Nathan Tintle,Haitian Wang,Hugues Aschard,Nora Nock,Elizabeth Pugh,Inchi Hu

doi:10.1002/gepi.20650

Abstract

As part of Genetic Analysis Workshop 17 (GAW17), our group considered the application of novel and standard approaches to the analysis of genotype-phenotype association in next-generation sequencing data. Our group identified a major issue in the analysis of the GAW17 next-generation sequencing data: type I error and false-positive report probability rates higher than those expected based on empirical type I error levels (as high as 90%). Two main causes emerged: population stratification and long-range correlation (gametic phase disequilibrium) between rare variants. Population stratification was expected because of the diverse sample. Correlation between rare variants was attributable to both random causes (e.g., nearly 10,000 of 25,000 markers were private variants, and the sample size was small [n = 697]) and nonrandom causes (more correlation was observed than was expected by random chance). Principal components analysis was used to control for population structure and helped to minimize type I errors, but this was at the expense of identifying fewer causal variants. A novel multiple regression approach showed promise to handle correlation between markers. Further work is needed, first, to identify best practices for the control of type I errors in the analysis of sequencing data and then to explore and compare the many promising new aggregating approaches for identifying markers associated with disease phenotypes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Inflated type I error rates when using aggregation methods to analyze rare variants in the 1000 Genomes Project exon sequencing data in unrelated individuals: summary results from Group 7 at Genetic Analysis Workshop 17.

Abstract

Talk to us

Similar Papers

More From: Genetic epidemiology

Lead the way for us

Journal: Genetic epidemiology	Publication Date: Jan 1, 2011
Citations: 29

Similar Papers

Extending Rare-Variant Testing Strategies: Analysis of Noncoding Sequence and Imputed Genotypes
Matthew Zawistowski ... Sebastian Zöllner
The American Journal of Human Genetics | VOL. 87
Matthew Zawistowski, et. al.Matthew Zawistowski ... Sebastian Zöllner
01 Nov 2010
The American Journal of Human Genetics | VOL. 87

Sequence Kernel Association Tests for the Combined Effect of Rare and Common Variants
Iuliana Ionita-Laza ... Xihong Lin
The American Journal of Human Genetics | VOL. 92
Iuliana Ionita-Laza, et. al.Iuliana Ionita-Laza ... Xihong Lin
16 May 2013
The American Journal of Human Genetics | VOL. 92

SNVer: a statistical tool for variant calling in analysis of pooled or individual next-generation sequencing data
Zhi Wei ... Pingzhao Hu
Nucleic Acids Research | VOL. 39
Zhi Wei, et. al.Zhi Wei ... Pingzhao Hu
03 Aug 2011
Nucleic Acids Research | VOL. 39

A general statistic to test an optimally weighted combination of common and/or rare variants.
Jianjun Zhang ... Shuanglin Zhang
Genetic Epidemiology | VOL. 43
Jianjun Zhang, et. al.Jianjun Zhang ... Shuanglin Zhang
09 Sep 2019
Genetic Epidemiology | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Inflated type I error rates when using aggregation methods to analyze rare variants in the 1000 Genomes Project exon sequencing data in unrelated individuals: summary results from Group 7 at Genetic Analysis Workshop 17.

Abstract

Talk to us

Similar Papers

More From: Genetic epidemiology