HaploPool: improving haplotype frequency estimation through DNA pools and phylogenetic modeling

Bonnie Kirkpatrick,Eran Halperin,Carlos Santos Armendariz,Richard M Karp

doi:10.1093/bioinformatics/btm435

Abstract

The search for genetic variants that are linked to complex diseases such as cancer, Parkinson's;, or Alzheimer's; disease, may lead to better treatments. Since haplotypes can serve as proxies for hidden variants, one method of finding the linked variants is to look for case-control associations between the haplotypes and disease. Finding these associations requires a high-quality estimation of the haplotype frequencies in the population. To this end, we present, HaploPool, a method of estimating haplotype frequencies from blocks of consecutive SNPs. HaploPool leverages the efficiency of DNA pools and estimates the population haplotype frequencies from pools of disjoint sets, each containing two or three unrelated individuals. We study the trade-off between pooling efficiency and accuracy of haplotype frequency estimates. For a fixed genotyping budget, HaploPool performs favorably on pools of two individuals as compared with a state-of-the-art non-pooled phasing method, PHASE. Of independent interest, HaploPool can be used to phase non-pooled genotype data with an accuracy approaching that of PHASE. We compared our algorithm to three programs that estimate haplotype frequencies from pooled data. HaploPool is an order of magnitude more efficient (at least six times faster), and considerably more accurate than previous methods. In contrast to previous methods, HaploPool performs well with missing data, genotyping errors and long haplotype blocks (of between 5 and 25 SNPs).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HaploPool: improving haplotype frequency estimation through DNA pools and phylogenetic modeling

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Sep 25, 2007
Citations: 37

Similar Papers

Efficiency of Estimation of Haplotype Frequencies: Use of Marker Phenotypes of Unrelated Individuals versus Counting of Phase-Known Gametes
Paul M Mckeigue
The American Journal of Human Genetics | VOL. 67
Paul M MckeiguePaul M Mckeigue
01 Dec 2000
The American Journal of Human Genetics | VOL. 67

Comparisons of Two Methods for Haplotype Reconstruction and Haplotype Frequency Estimation from Population Data
Shuanglin Zhang ... Hongyu Zhao
The American Journal of Human Genetics | VOL. 69
Shuanglin Zhang, et. al.Shuanglin Zhang ... Hongyu Zhao
01 Oct 2001
The American Journal of Human Genetics | VOL. 69

Estimate haplotype frequencies in pedigrees
Qiangfeng Zhang ... Yun Xu
BMC Bioinformatics | VOL. 7
Qiangfeng Zhang, et. al.Qiangfeng Zhang ... Yun Xu
01 Dec 2006
BMC Bioinformatics | VOL. 7

Estimating haplotype frequencies in pooled DNA samples when there is genotyping error
Shannon Re Quade ... Katrina Ab Goddard
BMC Genetics | VOL. 6
Shannon Re Quade, et. al.Shannon Re Quade ... Katrina Ab Goddard
01 Jan 2004
BMC Genetics | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HaploPool: improving haplotype frequency estimation through DNA pools and phylogenetic modeling

Abstract

Talk to us

Similar Papers

More From: Bioinformatics