Efficient algorithms for regular expression constrained sequence alignment

Yun-Sheng Chung,Chin Lung Lu,Chuan Yi Tang

doi:10.1016/j.ipl.2007.04.007

Abstract

Imposing constraints is an effective means to incorporate biological knowledge into alignment procedures. As in the PROSITE database, functional sites of proteins can be effectively described as regular expressions. In an alignment of protein sequences it is natural to expect that functional motifs should be aligned together. Due to this motivation, Arslan introduced the regular expression constrained sequence alignment problem and proposed an algorithm which, if implemented naïvely, can take time and space up to O ( | Σ | 2 | V | 4 n 2 ) and O ( | Σ | 2 | V | 4 n ) , respectively, where Σ is the alphabet, n is the sequence length, and V is the set of states in an automaton equivalent to the input regular expression. In this paper we propose a more efficient algorithm solving this problem which takes O ( | V | 3 n 2 ) time and O ( | V | 2 n ) space in the worst case. If | V | = O ( log n ) we propose another algorithm with time complexity O ( | V | 2 log | V | n 2 ) . The time complexity of our algorithms is independent of Σ, which is desirable in protein applications where the formulation of this problem originates; a factor of | Σ | 2 = 400 in the time complexity of the previously proposed algorithm would significantly affect the efficiency in practice.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient algorithms for regular expression constrained sequence alignment

Abstract

Talk to us

Similar Papers

More From: Information Processing Letters

Lead the way for us

Journal: Information Processing Letters	Publication Date: Apr 29, 2007
Citations: 11

Similar Papers

Efficient Algorithms for Regular Expression Constrained Sequence Alignment
Yun-Sheng Chung ... Chin Lung Lu
-
Yun-Sheng Chung, et. al.Yun-Sheng Chung ... Chin Lung Lu
01 Jan 2006
01 Jan 2006

An improved algorithm for the regular expression constrained multiple sequence alignment problem
Abdullah N Arslan ... Dan He
-
Abdullah N Arslan, et. al.Abdullah N Arslan ... Dan He
01 Oct 2006
01 Oct 2006

Multiple Sequence Alignment Containing a Sequence of Regular Expressions
A.N Arslan
-
A.N ArslanA.N Arslan
01 Jan 2004
01 Jan 2004

Practical regular expression constrained sequence alignment
Lise Rommel Romero Navarrete ... Guilherme P Telles
Theoretical Computer Science | VOL. 815
Lise Rommel Romero Navarrete, et. al.Lise Rommel Romero Navarrete ... Guilherme P Telles
17 Feb 2020
Theoretical Computer Science | VOL. 815

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient algorithms for regular expression constrained sequence alignment

Abstract

Talk to us

Similar Papers

More From: Information Processing Letters