Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data

Caroline Durrant,Andrew P Morris

doi:10.1186/1471-2156-6-s1-s100

Caroline Durrant, Andrew P Morris

Open Access

https://doi.org/10.1186/1471-2156-6-s1-s100

Copy DOI

Abstract

We recently described a method for linkage disequilibrium (LD) mapping, using cladistic analysis of phased single-nucleotide polymorphism (SNP) haplotypes in a logistic regression framework. However, haplotypes are often not available and cannot be deduced with certainty from the unphased genotypes. One possible two-stage approach is to infer the phase of multilocus genotype data and analyze the resulting haplotypes as if known. Here, haplotypes are inferred using the expectation-maximization (EM) algorithm and the best-guess phase assignment for each individual analyzed. However, inferring haplotypes from phase-unknown data is prone to error and this should be taken into account in the subsequent analysis. An alternative approach is to analyze the phase-unknown multilocus genotypes themselves. Here we present a generalization of the method for phase-known haplotype data to the case of unphased SNP genotypes. Our approach is designed for high-density SNP data, so we opted to analyze the simulated dataset. The marker spacing in the initial screen was too large for our method to be effective, so we used the answers provided to request further data in regions around the disease loci and in null regions. Power to detect the disease loci, accuracy in localizing the true site of the locus, and false-positive error rates are reported for the inferred-haplotype and unphased genotype methods. For this data, analyzing inferred haplotypes outperforms analysis of genotypes. As expected, our results suggest that when there is little or no LD between a disease locus and the flanking region, there will be no chance of detecting it unless the disease variant itself is genotyped.

Highlights

Disease-marker association studies of samples of unrelated cases and controls have been shown to have the potential to map all but extremely rare variants contributing to complex traits [1]
We recently described a method [2] for linkage disequilibrium (LD) mapping, using cladistic analysis of single-nucleotide polymorphism (SNP) haplotypes in a logistic regression framework, which allows straightforward incorporation of covariates
We propose in this paper a generalization of our cladistic analysis method for haplotypes to analyze unphased genotypes directly

Summary

Introduction

Disease-marker association studies of samples of unrelated cases and controls have been shown to have the potential to map all but extremely rare variants contributing to complex traits [1]. A number of statistical approaches have been developed to infer haplotypes and their relative frequencies in a sample and to assign phase to the multilocus genotypes. It is common to employ a two-stage approach of inferring phase and analyzing the 'best' haplotype configuration as if it were known with certainty. The disadvantage of this approach is that we cannot take account of the uncertainty in the phase assignment process. To overcome this problem, we propose in this paper a generalization of our cladistic analysis method for haplotypes to analyze unphased genotypes directly. We use the Genetic Analysis Workshop 14 (GAW14) simulated dataset to compare the analysis of unphased and inferred haplotype analysis in terms of power and accuracy to locate disease loci

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC genetics	Publication Date: Dec 1, 2005
Citations: 16	License type: cc-by

R Discovery Prime

R Discovery Prime

Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC genetics

Lead the way for us

Similar Papers

A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants
Andrew P Morris
American Journal of Human Genetics | VOL. 79
Andrew P MorrisAndrew P Morris
01 Oct 2006
American Journal of Human Genetics | VOL. 79

Extent and Distribution of Linkage Disequilibrium in Three Genomic Regions
Gonçalo R Abecasis ... William O.C Cookson
American Journal of Human Genetics | VOL. 68
Gonçalo R Abecasis, et. al.Gonçalo R Abecasis ... William O.C Cookson
01 Jan 2001
American Journal of Human Genetics | VOL. 68

Autism-Associated Haplotype Affects the Regulation of the Homeobox Gene, ENGRAILED 2
Rym Benayed ... James H Millonig
Biological psychiatry | VOL. 66
Rym Benayed, et. al.Rym Benayed ... James H Millonig
17 Jul 2009
Biological psychiatry | VOL. 66

Comparative LD mapping using single SNPs and haplotypes identifies QTL for plant height and biomass as secondary traits of drought tolerance in maize
Yanli Lu ... Zhuanfang Hao
Molecular breeding : new strategies in plant improvement | VOL. 30
Yanli Lu, et. al.Yanli Lu ... Zhuanfang Hao
22 Sep 2011
Molecular breeding : new strategies in plant improvement | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC genetics