Searching for repeats, as an example of using the generalised Ruzzo-Tompa algorithm to find optimal subsequences with gaps.

John L Spouge,Sergey L Sheetlin,Leonardo Mariño Ramírez

doi:10.1504/ijbra.2014.062991

Abstract

Some biological sequences contain subsequences of unusual composition; e.g. some proteins contain DNA binding domains, transmembrane regions and charged regions, and some DNA sequences contain repeats. The linear-time Ruzzo-Tompa (RT) algorithm finds subsequences of unusual composition, using a sequence of scores as input and the corresponding 'maximal segments' as output. In principle, permitting gaps in the output subsequences could improve sensitivity. Here, the input of the RT algorithm is generalised to a finite, totally ordered, weighted graph, so the algorithm locates paths of maximal weight through increasing but not necessarily adjacent vertices. By permitting the penalised deletion of unfavourable letters, the generalisation therefore includes gaps. The program RepWords, which finds inexact simple repeats in DNA, exemplifies the general concepts by out-performing a similar extant, ad hoc tool. With minimal programming effort, the generalised Ruzzo-Tompa algorithm could improve the performance of many programs for finding biological subsequences of unusual composition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Searching for repeats, as an example of using the generalised Ruzzo-Tompa algorithm to find optimal subsequences with gaps.

Abstract

Talk to us

Similar Papers

More From: International journal of bioinformatics research and applications

Lead the way for us

Journal: International journal of bioinformatics research and applications	Publication Date: Jan 1, 2014
Citations: 35

Similar Papers

The ruzzo-tompa algorithm can find the maximal paths in weighted, directed graphs on a one-dimensional lattice
John L Spouge ... Sergey L Sheetlin
-
John L Spouge, et. al.John L Spouge ... Sergey L Sheetlin
01 Feb 2012
01 Feb 2012

How p53 binds DNA as a tetramer.
K G Mclure
The EMBO Journal | VOL. 17
K G MclureK G Mclure
15 Jun 1998
The EMBO Journal | VOL. 17

Aiolos, a lymphoid restricted transcription factor that interacts with Ikaros to regulate lymphocyte differentiation.
B Morgan
The EMBO Journal | VOL. 16
B MorganB Morgan
15 Apr 1997
The EMBO Journal | VOL. 16

RF15 | PMON291 Crystal Structures of Androgen DNA Binding Domain Interacting with its Response Elements (C3(1) ARE and MMTV-GRE)
Frank Claessens ... Christine Helsen
Journal of the Endocrine Society | VOL. 6
Frank Claessens, et. al.Frank Claessens ... Christine Helsen
01 Nov 2022
Journal of the Endocrine Society | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Searching for repeats, as an example of using the generalised Ruzzo-Tompa algorithm to find optimal subsequences with gaps.

Abstract

Talk to us

Similar Papers

More From: International journal of bioinformatics research and applications