An accurate algorithm for the detection of DNA fragments from dilution pool sequencing experiments.

Vikas Bansal

doi:10.1093/bioinformatics/btx436

Abstract

The short read lengths of current high-throughput sequencing technologies limit the ability to recover long-range haplotype information. Dilution pool methods for preparing DNA sequencing libraries from high molecular weight DNA fragments enable the recovery of long DNA fragments from short sequence reads. These approaches require computational methods for identifying the DNA fragments using aligned sequence reads and assembling the fragments into long haplotypes. Although a number of computational methods have been developed for haplotype assembly, the problem of identifying DNA fragments from dilution pool sequence data has not received much attention. We formulate the problem of detecting DNA fragments from dilution pool sequencing experiments as a genome segmentation problem and develop an algorithm that uses dynamic programming to optimize a likelihood function derived from a generative model for the sequence reads. This algorithm uses an iterative approach to automatically infer the mean background read depth and the number of fragments in each pool. Using simulated data, we demonstrate that our method, FragmentCut, has 25-30% greater sensitivity compared with an HMM based method for fragment detection and can also detect overlapping fragments. On a whole-genome human fosmid pool dataset, the haplotypes assembled using the fragments identified by FragmentCut had greater N50 length, 16.2% lower switch error rate and 35.8% lower mismatch error rate compared with two existing methods. We further demonstrate the greater accuracy of our method using two additional dilution pool datasets. FragmentCut is available from https://bansal-lab.github.io/software/FragmentCut. vibansal@ucsd.edu. Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An accurate algorithm for the detection of DNA fragments from dilution pool sequencing experiments.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)

Lead the way for us

Similar Papers

Association of high molecular weight DNA fragmentation with apoptotic or non-apoptotic cell death induced by calcium ionophore
Akihiro Kataoka ... Kenshi Furusho
FEBS Letters | VOL. 364
Akihiro Kataoka, et. al.Akihiro Kataoka ... Kenshi Furusho
15 May 1995
FEBS Letters | VOL. 364

G-SNPM - A GPU-based SNP mapping tool
Alessandro Orro ... Andrea Manconi
EMBnet.journal | VOL. 18
Alessandro Orro, et. al.Alessandro Orro ... Andrea Manconi
09 Nov 2012
EMBnet.journal | VOL. 18

Microindel detection in short-read sequence data
Peter Krawitz ... Marten Jäger
Bioinformatics | VOL. 26
Peter Krawitz, et. al.Peter Krawitz ... Marten Jäger
09 Feb 2010
Bioinformatics | VOL. 26

The CrmA- and TPCK-sensitive pathways that trigger oligonucleosome-sized DNA fragmentation in camptothecin-induced apoptosis: relation to caspase activation and high molecular weight DNA fragmentation
A.T Sané ... R Bertrand
Biochemistry and Cell Biology | VOL. 75
A.T Sané, et. al.A.T Sané ... R Bertrand
01 Aug 1997
Biochemistry and Cell Biology | VOL. 75

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An accurate algorithm for the detection of DNA fragments from dilution pool sequencing experiments.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)