Locating All Common Subsequences in Two DNA Sequences

M I Khalil

doi:10.5815/ijitcs.2016.05.09

Abstract

Biological sequence comparison is one of the most important and basic problems in computational biology. Due to its high demands for computational power and memory, it is a very challenging task. The well-known algorithm proposed by Smith-Waterman obtains the best local alignments at the expense of very high computing power and huge memory requirements. This paper introduces a new efficient algorithm to locate the longest common subsequences (LCS) in two different DNA sequences. It is based on the convolution between the two DNA sequences: The major sequence is represented in the linked-list X while the minor one is represented in circular linked-list Y. An array of linked lists is established where each linked list is corresponding to an element of the linked-list X and a new node is added to it for each match between the two sequences. If two or more matches in different locations in string Y share the same location in string X, the corresponding nodes will construct a unique linked-list. Accordingly, by the end of processing, we obtain a group of linked-lists containing nodes that reflect all possible matches between the two sequences X and Y. The proposed algorithm has been implemented and tested using C# language. The benchmark test shows very good speedups and indicated that impressive improvements has been achieved.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Locating All Common Subsequences in Two DNA Sequences

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Technology and Computer Science

Lead the way for us

Journal: International Journal of Information Technology and Computer Science	Publication Date: May 8, 2016
Citations: 16

Similar Papers

Parallel Smith-Waterman Algorithm for Local DNA Comparison in a Cluster of Workstations
Azzedine Boukerche ... Alba Cristina Magalhaes Alves De Melo
-
Azzedine Boukerche, et. al.Azzedine Boukerche ... Alba Cristina Magalhaes Alves De Melo
01 Jan 2004
01 Jan 2004

Parallel strategies for the local biological sequence alignment in a cluster of workstations
Azzedine Boukerche ... Maria Emilia Machado Telles Walter
Journal of Parallel and Distributed Computing | VOL. 67
Azzedine Boukerche, et. al.Azzedine Boukerche ... Maria Emilia Machado Telles Walter
27 Dec 2006
Journal of Parallel and Distributed Computing | VOL. 67

Using a DSM application to locally align DNA sequences
R Bezerra Batista ... Li Weigang
-
R Bezerra Batista, et. al.R Bezerra Batista ... Li Weigang
19 Apr 2004
19 Apr 2004

An Improved Longest Common Subsequence Algorithm for Reducing Memory Complexity in Global Alignment of DNA Sequences
Elham Parvinnia ... Mohammad Taheri
-
Elham Parvinnia, et. al.Elham Parvinnia ... Mohammad Taheri
01 May 2008
01 May 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Locating All Common Subsequences in Two DNA Sequences

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Technology and Computer Science