Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory

Mark J Chaisson,Glenn Tesler

doi:10.1186/1471-2105-13-238

Abstract

BackgroundRecent methods have been developed to perform high-throughput sequencing of DNA by Single Molecule Sequencing (SMS). While Next-Generation sequencing methods may produce reads up to several hundred bases long, SMS sequencing produces reads up to tens of kilobases long. Existing alignment methods are either too inefficient for high-throughput datasets, or not sensitive enough to align SMS reads, which have a higher error rate than Next-Generation sequencing.ResultsWe describe the method BLASR (Basic Local Alignment with Successive Refinement) for mapping Single Molecule Sequencing (SMS) reads that are thousands of bases long, with divergence between the read and genome dominated by insertion and deletion error. The method is benchmarked using both simulated reads and reads from a bacterial sequencing project. We also present a combinatorial model of sequencing error that motivates why our approach is effective.ConclusionsThe results indicate that it is possible to map SMS reads with high accuracy and speed. Furthermore, the inferences made on the mapability of SMS reads using our combinatorial model of sequencing error are in agreement with the mapping accuracy demonstrated on simulated reads.

Highlights

Recent methods have been developed to perform high-throughput sequencing of DNA by Single Molecule Sequencing (SMS)
Basic Local Alignment via Successive Refinement (BLASR), which maps reads using coarse alignment methods developed during whole genome alignment (WGA) studies, while speeding up these methods by using the advanced data structures employed in many next generation sequencing (NGS) mapping studies
We present a practical comparison of alignment methods on PacBioRS sequences

Summary

Introduction

Recent methods have been developed to perform high-throughput sequencing of DNA by Single Molecule Sequencing (SMS). Existing alignment methods are either too inefficient for high-throughput datasets, or not sensitive enough to align SMS reads, which have a higher error rate than Next-Generation sequencing. Reads produced by Sanger sequencing that are highly accurate and nearly 1000 bases long are successfully mapped using hash-based methods such as MEGABLAST [2], cross match (Green P., www.phrap.org, unpublished), and BLAT [3]. These methods are too inefficient to map read sets from generation sequencing (NGS) instruments by Illumina (San Diego, CA, USA)

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC bioinformatics	Publication Date: Sep 19, 2012
Citations: 1066	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics

Lead the way for us

Similar Papers

MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads.
Chuan-Le Xiao ... Zhi Xie
Nature Methods | VOL. 14
Chuan-Le Xiao, et. al.Chuan-Le Xiao ... Zhi Xie
18 Sep 2017
Nature Methods | VOL. 14

Sequencing Paired Reads using True Single Molecule Sequencing (tSMS)™ Technology
Jeff G Reifenberger ... Patrice Milos
Biophysical journal | VOL. 96
Jeff G Reifenberger, et. al.Jeff G Reifenberger ... Patrice Milos
01 Feb 2009
Biophysical journal | VOL. 96

Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing
Peter Edge ... Vikas Bansal
Nature Communications | VOL. 10
Peter Edge, et. al.Peter Edge ... Vikas Bansal
11 Oct 2019
Nature Communications | VOL. 10

Lra: A long read aligner for sequences and contigs
Ferhat Ay ... Jian Ma
-
Ferhat Ay, et. al.Ferhat Ay ... Jian Ma
21 Jun 2021
21 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics