Efficient algorithms for locating the length-constrained heaviest segments with applications to biomolecular sequence analysis

Yaw-Ling Lin,Tao Jiang,Kun-Mao Chao

doi:10.1016/s0022-0000(02)00010-7

Abstract

We study two fundamental problems concerning the search for interesting regions in sequences: (i) given a sequence of real numbers of length n and an upper bound U, find a consecutive subsequence of length at most U with the maximum sum and (ii) given a sequence of real numbers of length n and a lower bound L, find a consecutive subsequence of length at least L with the maximum average. We present an O( n)-time algorithm for the first problem and an O(n log L) -time algorithm for the second. The algorithms have potential applications in several areas of biomolecular sequence analysis including locating GC-rich regions in a genomic DNA sequence, post-processing sequence alignments, annotating multiple sequence alignments, and computing length-constrained ungapped local alignment. Our preliminary tests on both simulated and real data demonstrate that the algorithms are very efficient and able to locate useful (such as GC-rich) regions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Computer and System Sciences	Publication Date: Nov 1, 2002
Citations: 119	License type: mit

R Discovery Prime

R Discovery Prime

Efficient algorithms for locating the length-constrained heaviest segments with applications to biomolecular sequence analysis

Abstract

Talk to us

Similar Papers

More From: Journal of Computer and System Sciences

Lead the way for us

Similar Papers

Efficient Algorithms for Locating the Length-Constrained Heaviest Segments, with Applications to Biomolecular Sequence Analysis
Yaw-Ling Lin ... Tao Jiang
-
Yaw-Ling Lin, et. al.Yaw-Ling Lin ... Tao Jiang
01 Jan 2002
01 Jan 2002

Glossary
Fran Lewitter ... Janet M Thornton
Trends in Biotechnology | VOL. 16
Fran Lewitter, et. al.Fran Lewitter ... Janet M Thornton
01 Nov 1998
Trends in Biotechnology | VOL. 16

Theoretical Analysis of the Stress Induced B-Z Transition in Superhelical DNA
Dina Zhabinskaya ... Craig J Benham
PLoS Computational Biology | VOL. 7
Dina Zhabinskaya, et. al.Dina Zhabinskaya ... Craig J Benham
20 Jan 2011
PLoS Computational Biology | VOL. 7

AlcoR: alignment-free simulation, mapping, and visualization of low-complexity regions in biological data.
Jorge M Silva ... Diogo Pratas
GigaScience | VOL. 12
Jorge M Silva, et. al.Jorge M Silva ... Diogo Pratas
28 Dec 2022
GigaScience | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient algorithms for locating the length-constrained heaviest segments with applications to biomolecular sequence analysis

Abstract

Talk to us

Similar Papers

More From: Journal of Computer and System Sciences