Fast comparison of DNA sequences by oligonucleotide profiling

Vicente Arnau,Ignacio Marín,Miguel Gallach

doi:10.1186/1756-0500-1-5

Abstract

BackgroundThe comparison of DNA sequences is a traditional problem in genomics and bioinformatics. Many new opportunities emerge due to the improvement of personal computers, allowing the implementation of novel strategies of analysis.FindingsWe describe a new program, called UVWORD, which determines the number of times that each DNA word present in a sequence (target) is found in a second sequence (source), a procedure that we have called oligonucleotide profiling. On a standard computer, the user may search for words of a size ranging from k = 1 to k = 14 nucleotides. Average counts for groups of contiguous words may also be established. The rate of analysis on standard computers is from 3.4 (k = 14) to 16 millions of words per second (1 ≤ k ≤ 8). This makes feasible the fast screening of even the longest known DNA molecules.DiscussionWe show that the combination of the ability of analyzing words of relatively long size, which occur very rarely by chance, and the fast speed of the program allows to perform novel types of screenings, complementary to those provided by standard programs such as BLAST. This method can be used to determine oligonucleotide content, to characterize the distribution of repetitive sequences in chromosomes, to determine the evolutionary conservation of sequences in different species, to establish regions of similar DNA among chromosomes or genomes, etc.

Highlights

The comparison of DNA sequences is a traditional problem in genomics and bioinformatics
We show that the combination of the ability of analyzing words of relatively long size, which occur very rarely by chance, and the fast speed of the program allows to perform novel types of screenings, complementary to those provided by standard programs such as BLAST
Oligonucleotide profiling using UVWORD Here we describe a new program, UVWORD, which implements a strategy of analysis that we have called oligonucleotide profiling

Summary

Discussion

We show that the combination of the ability of analyzing words of relatively long size, which occur very rarely by chance, and the fast speed of the program allows to perform novel types of screenings, complementary to those provided by standard programs such as BLAST. This method can be used to determine oligonucleotide content, to characterize the distribution of repetitive sequences in chromosomes, to determine the evolutionary conservation of sequences in different species, to establish regions of similar DNA among chromosomes or genomes, etc

Findings

Discussion and conclusion

Kent WJ

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Research Notes	Publication Date: Feb 28, 2008
Citations: 30	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Fast comparison of DNA sequences by oligonucleotide profiling

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Research Notes

Lead the way for us

Similar Papers

Discovering dependencies via algorithmic mutual information: A case study in DNA sequence comparisons
Aleksandar Milosavljević
Machine Learning | VOL. 21
Aleksandar MilosavljevićAleksandar Milosavljević
01 Oct 1995
Machine Learning | VOL. 21

6 Pezizomycotina: Dothideomycetes and Arthoniomycetes
Conrad Schoch ... Martin Grube
-
Conrad Schoch, et. al.Conrad Schoch ... Martin Grube
01 Jan 2015
01 Jan 2015

A NOTE ON COMPLEXITY OF GENETIC MUTATIONS
Bhadrachalam Chitturi
Discrete Mathematics, Algorithms and Applications | VOL. 03
Bhadrachalam ChitturiBhadrachalam Chitturi
01 Sep 2011
Discrete Mathematics, Algorithms and Applications | VOL. 03

Survey of Biological High Performance Computing: Algorithms, Implementations and Outlook Research
Nasreddine Hireche ... J.M Langlois
-
Nasreddine Hireche, et. al.Nasreddine Hireche ... J.M Langlois
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast comparison of DNA sequences by oligonucleotide profiling

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Research Notes