Efficient alignment of pyrosequencing reads for re-sequencing applications

Francisco Fernandes,Paulo Gs Da Fonseca,Ana T Freitas,Arlindo L Oliveira,Luis Ms Russo

doi:10.1186/1471-2105-12-163

Abstract

BackgroundOver the past few years, new massively parallel DNA sequencing technologies have emerged. These platforms generate massive amounts of data per run, greatly reducing the cost of DNA sequencing. However, these techniques also raise important computational difficulties mostly due to the huge volume of data produced, but also because of some of their specific characteristics such as read length and sequencing errors. Among the most critical problems is that of efficiently and accurately mapping reads to a reference genome in the context of re-sequencing projects.ResultsWe present an efficient method for the local alignment of pyrosequencing reads produced by the GS FLX (454) system against a reference sequence. Our approach explores the characteristics of the data in these re-sequencing applications and uses state of the art indexing techniques combined with a flexible seed-based approach, leading to a fast and accurate algorithm which needs very little user parameterization. An evaluation performed using real and simulated data shows that our proposed method outperforms a number of mainstream tools on the quantity and quality of successful alignments, as well as on the execution time.ConclusionsThe proposed methodology was implemented in a software tool called TAPyR--Tool for the Alignment of Pyrosequencing Reads--which is publicly available from http://www.tapyr.net.

Highlights

Over the past few years, new massively parallel DNA sequencing technologies have emerged
We evaluated TAPyR against other mainstream mapping tools which are able to deal with high-throughput pyrosequencing reads, namely BWA-SW [11], SSAHA2 [13], Segemehl [10], GASSST [12], and Newbler [14]
We wanted to analyze the ability of the algorithms to produce high coverage mappings, which directly relates to the proportion of reads that can be successfully mapped

Summary

Results

We present an efficient method for the local alignment of pyrosequencing reads produced by the GS FLX (454) system against a reference sequence. Our approach explores the characteristics of the data in these resequencing applications and uses state of the art indexing techniques combined with a flexible seed-based approach, leading to a fast and accurate algorithm which needs very little user parameterization. An evaluation performed using real and simulated data shows that our proposed method outperforms a number of mainstream tools on the quantity and quality of successful alignments, as well as on the execution time

Background

Results and Discussion

Conclusions

Methods

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: May 16, 2011
Citations: 31	License type: cc-by

R Discovery Prime

R Discovery Prime

Efficient alignment of pyrosequencing reads for re-sequencing applications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

TAPYR: An efficient high-throughput sequence aligner for re-sequencing applications
Francisco Fernandes ... Ana T Freitas
EMBnet.journal | VOL. 17
Francisco Fernandes, et. al.Francisco Fernandes ... Ana T Freitas
28 Feb 2012
EMBnet.journal | VOL. 17

Parallel efficient aligner of pyrosequencing reads
Miguel E Coimbra ... Ana T Freitas
-
Miguel E Coimbra, et. al.Miguel E Coimbra ... Ana T Freitas
15 Sep 2013
15 Sep 2013

Accelerating Long Read Alignment on Three Processors
Zonghao Feng ... Qiong Luo
-
Zonghao Feng, et. al.Zonghao Feng ... Qiong Luo
05 Aug 2019
05 Aug 2019

Unravelling reference bias in ancient DNA datasets.
Stephanie Dolenz ... Peter D Heintzman
Bioinformatics (Oxford, England) | VOL. 40
Stephanie Dolenz, et. al.Stephanie Dolenz ... Peter D Heintzman
01 Jul 2024
Bioinformatics (Oxford, England) | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient alignment of pyrosequencing reads for re-sequencing applications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics