Block Aligner: an adaptive SIMD-accelerated aligner for sequences and position-specific scoring matrices.

Daniel Liu,Martin Steinegger

doi:10.1093/bioinformatics/btad487

Abstract

Efficiently aligning sequences is a fundamental problem in bioinformatics. Many recent algorithms for computing alignments through Smith-Waterman-Gotoh dynamic programming (DP) exploit Single Instruction Multiple Data (SIMD) operations on modern CPUs for speed. However, these advances have largely ignored difficulties associated with efficiently handling complex scoring matrices or large gaps (insertions or deletions). We propose a new SIMD-accelerated algorithm called Block Aligner for aligning nucleotide and protein sequences against other sequences or position-specific scoring matrices. We introduce a new paradigm that uses blocks in the DP matrix that greedily shift, grow, and shrink. This approach allows regions of the DP matrix to be adaptively computed. Our algorithm reaches over 5-10 times faster than some previous methods while incurring an error rate of less than 3% on protein and long read datasets, despite large gaps and low sequence identities. Our algorithm is implemented for global, local, and X-drop alignments. It is available as a Rust library (with C bindings) at https://github.com/Daniel-Liu-c0deb0t/block-aligner.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bioinformatics	Publication Date: Aug 1, 2023
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Block Aligner: an adaptive SIMD-accelerated aligner for sequences and position-specific scoring matrices.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

Fast index based algorithms and software for matching position specific scoring matrices.
Michael Beckstette ... Stefan Kurtz
BMC bioinformatics | VOL. 7
Michael Beckstette, et. al.Michael Beckstette ... Stefan Kurtz
24 Aug 2006
BMC bioinformatics | VOL. 7

Glossary
Fran Lewitter ... Janet M Thornton
Trends in Biotechnology | VOL. 16
Fran Lewitter, et. al.Fran Lewitter ... Janet M Thornton
01 Nov 1998
Trends in Biotechnology | VOL. 16

On the Complexity of Deriving Position Specific Score Matrices from Examples
Tatsuya Akutsu ... Sascha Ott
-
Tatsuya Akutsu, et. al.Tatsuya Akutsu ... Sascha Ott
01 Jan 2002
01 Jan 2002

Computational Prediction of Transcription Factor Binding Sites Based on HMM Model and Information Content
Xiaobao Su ... Lifang Liu
International Journal of Digital Content Technology and its Applications | VOL. 5
Xiaobao Su , et. al.Xiaobao Su ... Lifang Liu
31 Oct 2011
International Journal of Digital Content Technology and its Applications | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Block Aligner: an adaptive SIMD-accelerated aligner for sequences and position-specific scoring matrices.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics