A new method for finding long consensus patterns in nucleic acid sequences

Philip Taylor,Paul Rosenberg,Mary.G Samsonova

doi:10.1093/bioinformatics/7.4.495

Philip Taylor, Paul Rosenberg + Show 1 more

https://doi.org/10.1093/bioinformatics/7.4.495

Copy DOI

Abstract

We describe a fast computer algorithm for identifying consensus patterns in DNA sequences. The method requires no prior assumptions about the consensus pattern other than its length. In particular no previous knowledge of the frequency or spacing of consensus patterns is required. However, a priori information about the shape of the consensus pattern, or invariability of individual positions, or the overall conservation level, can be utilized to enhance the selectivity and sensitivity of search. As the number of all possible consensus words increases very rapidly with length, comprehensive searches have usually been restricted to a maximum of 10-12 nucleotides, even when large mainframes are used. Our algorithm enables searching for consensus patterns of this order on current mid-range and powerful microcomputers. Searches may be conducted on single, long sequences or a set of possibly aligned shorter sequences. We give examples of identified consensus patterns in both prokaryotic and eukaryotic DNA sequences, along with some typical program timings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new method for finding long consensus patterns in nucleic acid sequences

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Jan 1, 1991
Citations: 3

Similar Papers

Genomic Signature in Evolutionary Biology: A Review
Rebeca De La Fuente ... Andrés Moya
Biology | VOL. 12
Rebeca De La Fuente, et. al.Rebeca De La Fuente ... Andrés Moya
16 Feb 2023
Biology | VOL. 12

Frequent Patterns Mining in DNA Sequence
Na Deng ... Desheng Li
IEEE Access | VOL. 7
Na Deng, et. al.Na Deng ... Desheng Li
01 Jan 2019
IEEE Access | VOL. 7

SIMD parallelization of the WORDUP algorithm for detecting statistically significant patterns in DNA sequences.
Sabino Liuni ... Nicola Prunella
Computer applications in the biosciences : CABIOS | VOL. 9
Sabino Liuni, et. al.Sabino Liuni ... Nicola Prunella
01 Jan 1992
Computer applications in the biosciences : CABIOS | VOL. 9

Using Suffix Tree to Discover Complex Repetitive Patterns in DNA Sequences
Dan He
-
Dan HeDan He
01 Aug 2006
01 Aug 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new method for finding long consensus patterns in nucleic acid sequences

Abstract

Talk to us

Similar Papers

More From: Bioinformatics