Abstract

Dense SNP maps can be highly informative for linkage studies. But when parental genotypes are missing, multipoint linkage scores can be inflated in regions with substantial marker-marker linkage disequilibrium (LD). Such regions were observed in the Affymetrix SNP genotypes for the Genetic Analysis Workshop 14 (GAW14) Collaborative Study on the Genetics of Alcoholism (COGA) dataset, providing an opportunity to test a novel simulation strategy for studying this problem. First, an inheritance vector (with or without linkage present) is simulated for each replicate, i.e., locations of recombinations and transmission of parental chromosomes are determined for each meiosis. Then, two sets of founder haplotypes are superimposed onto the inheritance vector: one set that is inferred from the actual data and which contains the pattern of LD; and one set created by randomly selecting parental alleles based on the known allele frequencies, with no correlation (LD) between markers. Applying this strategy to a map of 176 SNPs (66 Mb of chromosome 7) for 100 replicates of 116 sibling pairs, significant inflation of multipoint linkage scores was observed in regions of high LD when parental genotypes were set to missing, with no linkage present. Similar inflation was observed in analyses of the COGA data for these affected sib pairs with parental genotypes set to missing, but not after reducing the marker map until r2 between any pair of markers was ≤ 0.05. Additional simulation studies of affected sib pairs assuming uniform LD throughout a marker map demonstrated inflation of significance levels at r2 values greater than 0.05. When genotypes are available only from two affected siblings in many families in a sample, trimming SNP maps to limit r2 to 0–0.05 for all marker pairs will prevent inflation of linkage scores without sacrificing substantial linkage information. Simulation studies on the observed pedigree structures and map can also be used to determine the effect of LD on a particular study.

Highlights

  • Linkage genome scans using dense maps of single nucleotide polymorphism (SNP) markers have been shown to provide greater information content than 10-cM microsatellite scans [1,2,3,4,5]

  • False positive peaks were observed in a SNP-based linkage study of prostate cancer in regions with marker-marker D' values greater than 0.6 [4]; and in simulations of pairs of markers with no linkage present, inflation of linkage scores was observed as marker-marker D' values increased between 0.4 and 0.8 [6]

  • These analyses support the conclusion that when parental genotypes are missing and cannot be reconstructed from the constellation of genotyped individuals, the presence of marker-marker linkage disequilibrium (LD) can substantially inflate linkage scores [4,6]

Read more

Summary

Introduction

Linkage genome scans using dense maps of single nucleotide polymorphism (SNP) markers have been shown to provide greater information content than 10-cM microsatellite scans [1,2,3,4,5]. False positive peaks were observed in a SNP-based linkage study of prostate cancer in regions with marker-marker D' values greater than 0.6 [4]; and in simulations of pairs of markers with no linkage present, inflation of linkage scores was observed as marker-marker D' values increased between 0.4 and 0.8 [6]. The Collaborative Study on the Genetics of Alcoholism (COGA) datasets for Genetic Analysis Workshop 14 (GAW14) provided an opportunity to study this problem,. LwFiinigtkhuaragend1scwoirtheos ufotrp1ar1e6nCtaOl gGenAoAtySpPess: High vs low LD maps Linkage scores for 116 COGA ASPs: High vs low LD maps with and without parental genotypes. The main goal of the present analyses was to test a novel simulation strategy for studying the effect of LD on linkage scores with and without availability of parental genotypes

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call