Haplotype reconstruction using perfect phylogeny and sequence data

Anatoly Efros,Eran Halperin

doi:10.1186/1471-2105-13-s6-s3

Anatoly Efros, Eran Halperin

Open Access

https://doi.org/10.1186/1471-2105-13-s6-s3

Copy DOI

Abstract

Haplotype phasing is a well studied problem in the context of genotype data. With the recent developments in high-throughput sequencing, new algorithms are needed for haplotype phasing, when the number of samples sequenced is low and when the sequencing coverage is blow. High-throughput sequencing technologies enables new possibilities for the inference of haplotypes. Since each read is originated from a single chromosome, all the variant sites it covers must derive from the same haplotype. Moreover, the sequencing process yields much higher SNP density than previous methods, resulting in a higher correlation between neighboring SNPs. We offer a new approach for haplotype phasing, which leverages on these two properties. Our suggested algorithm, called Perfect Phlogeny Haplotypes from Sequencing (PPHS) uses a perfect phylogeny model and it models the sequencing errors explicitly. We evaluated our method on real and simulated data, and we demonstrate that the algorithm outperforms previous methods when the sequencing error rate is high or when coverage is low.

Highlights

The etiology of complex diseases is composed of both environmental and genetic factors
These studies have been focusing on measurements of single nucleotide polymorphisms (SNPs), which are positions in the genome in which at some point in history there has been a mutation that was fixed in the population
Our algorithm aims at finding a perfect phylogeny tree on the set of SNPs in a given window, and a corresponding haplotype assignment for each individual

Summary

Introduction

The etiology of complex diseases is composed of both environmental and genetic factors. Much of this effort has been focused on genome-wide association studies (GWAS), in which the DNA of a population of cases (individuals carrying the studied condition), and a population of controls (general population) is being measured and compared. These studies have been focusing on measurements of single nucleotide polymorphisms (SNPs), which are positions in the genome in which at some point in history there has been a mutation that was fixed in the population.

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Apr 19, 2012
Citations: 19	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Haplotype reconstruction using perfect phylogeny and sequence data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Author response: A method for low-coverage single-gamete sequence analysis demonstrates adherence to Mendel’s first law across a large sample of human sperm
Sara A Carioscia ... Avery Davis Bell
-
Sara A Carioscia, et. al.Sara A Carioscia ... Avery Davis Bell
05 May 2022
05 May 2022

Decision letter: A method for low-coverage single-gamete sequence analysis demonstrates adherence to Mendel’s first law across a large sample of human sperm
Molly Przeworski
-
Molly PrzeworskiMolly Przeworski
19 Apr 2022
19 Apr 2022

Editor's evaluation: A method for low-coverage single-gamete sequence analysis demonstrates adherence to Mendel’s first law across a large sample of human sperm
Daniel R Matute
-
Daniel R MatuteDaniel R Matute
19 Apr 2022
19 Apr 2022

On the design and analysis of next-generation sequencing genotyping for a cohort with haplotype-informative reads
Degui Zhi ... Kui Zhang
Methods | VOL. 79-80
Degui Zhi, et. al.Degui Zhi ... Kui Zhang
30 Jan 2015
Methods | VOL. 79-80

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Haplotype reconstruction using perfect phylogeny and sequence data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics