High speed BLASTN: an accelerated MegaBLAST search tool.

Ying Chen,Yuesheng Xu,Yongdong Zhang,Weicai Ye

doi:10.1093/nar/gkv784

Ying Chen, Yuesheng Xu + Show 2 more

Open Access

https://doi.org/10.1093/nar/gkv784

Copy DOI

Abstract

Sequence alignment is a long standing problem in bioinformatics. The Basic Local Alignment Search Tool (BLAST) is one of the most popular and fundamental alignment tools. The explosive growth of biological sequences calls for speedup of sequence alignment tools such as BLAST. To this end, we develop high speed BLASTN (HS-BLASTN), a parallel and fast nucleotide database search tool that accelerates MegaBLAST—the default module of NCBI-BLASTN. HS-BLASTN builds a new lookup table using the FMD-index of the database and employs an accurate and effective seeding method to find short stretches of identities (called seeds) between the query and the database. HS-BLASTN produces the same alignment results as MegaBLAST and its computational speed is much faster than MegaBLAST. Specifically, our experiments conducted on a 12-core server show that HS-BLASTN can be 22 times faster than MegaBLAST and exhibits better parallel performance than MegaBLAST. HS-BLASTN is written in C++ and the related source code is available at https://github.com/chenying2016/queries under the GPLv3 license.

Highlights

Identifying sequences having statistically significant local alignments with a given query is routine in computational biology
We compare the performance of HSBLASTN with that of MegaBLAST on each query set under different numbers of CPU threads
T M(q, n) T H(q, n) as the relative speedup achieved by HS-BLASTN in comparison to MegaBLAST when both alignment tools running on query set q under n CPU threads

Summary

Introduction

Identifying sequences (in a target database) having statistically significant local alignments with a given query is routine in computational biology. BLAST builds a lookup table for the query, and scans the database for seeds, which are heuristic points for significant local alignments. These seeds are extended to longer ungapped alignments and to gapped alignments. Searching homologous sequences in a target database is a bottleneck in bioinformatics due to the exponential growth in the number of biological sequences [3]. Many methods were proposed to address this issue. They can be divided into two categories: hardware acceleration and improved indexing

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nucleic Acids Research	Publication Date: Aug 6, 2015
Citations: 384	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

High speed BLASTN: an accelerated MegaBLAST search tool.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nucleic Acids Research

Lead the way for us

Similar Papers

Blast-i2b2: Blast for Biological Sequence Comparison in i2b2 Platform
Alaa Alarfaj ... Mohy Uddin
Journal of Computer Science & Systems Biology | VOL. 10
Alaa Alarfaj, et. al.Alaa Alarfaj ... Mohy Uddin
01 Jan 2017
Blast-i2b2: Blast for Biological Sequence Comparison in i2b2 Platform
Alaa Alarfaj ... Mohy Uddin

Targeting the Human Cancer Pathway Protein Interaction Network by Structural Genomics
Yuanpeng Janet Huang ... Gaetano T Montelione
Molecular & Cellular Proteomics | VOL. 7
Yuanpeng Janet Huang, et. al.Yuanpeng Janet Huang ... Gaetano T Montelione
01 Oct 2008
Molecular & Cellular Proteomics | VOL. 7

English
S Donkor Eric ... K Adiku Theophilus
Journal of Bioinformatics and Sequence Analysis | VOL. 6
S Donkor Eric, et. al.S Donkor Eric ... K Adiku Theophilus
30 Apr 2014
Journal of Bioinformatics and Sequence Analysis | VOL. 6

UniProt Tools: BLAST, Align, Peptide Search, and ID Mapping.
Rossana Zaru ... Sandra Orchard
Current Protocols | VOL. 3
Rossana Zaru, et. al.Rossana Zaru ... Sandra Orchard
01 Mar 2023
Current Protocols | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High speed BLASTN: an accelerated MegaBLAST search tool.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nucleic Acids Research