SEGMENT: identifying compositional domains in DNA sequences.

J L.Oliver,J Perez,R Roman-Roldan,P Bernaola-Galvan

doi:10.1093/bioinformatics/15.12.974

Abstract

DNA sequences are formed by patches or domains of different nucleotide composition. In a few simple sequences, domains can simply be identified by eye; however, most DNA sequences show a complex compositional heterogeneity (fractal structure), which cannot be properly detected by current methods. Recently, a computationally efficient segmentation method to analyse such nonstationary sequence structures, based on the Jensen-Shannon entropic divergence, has been described. Specific algorithms implementing this method are now needed. Here we describe a heuristic segmentation algorithm for DNA sequences, which was implemented on a Windows program (SEGMENT). The program divides a DNA sequence into compositionally homogeneous domains by iterating a local optimization procedure at a given statistical significance. Once a sequence is partitioned into domains, a global measure of sequence compositional complexity (SCC), accounting for both the sizes and compositional biases of all the domains in the sequence, is derived. SEGMENT computes SCC as a function of the significance level, which provides a multiscale view of sequence complexity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SEGMENT: identifying compositional domains in DNA sequences.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)

Lead the way for us

Journal: Bioinformatics (Oxford, England)	Publication Date: Dec 1, 1999
Citations: 60

Similar Papers

Elucidation of tRNA-dependent editing by a class II tRNA synthetase and significance for cell viability.
Kirk Beebe
The EMBO Journal | VOL. 22
Kirk BeebeKirk Beebe
03 Feb 2003
The EMBO Journal | VOL. 22

LWPQ: an in silico designed multi-core super-agonist motif-like regulatory peptide for the activation of human stem cell transcripts using an integrated computational approach
N Grigoriadis ... I Grigoriadis
Cytotherapy | VOL. 16
N Grigoriadis, et. al.N Grigoriadis ... I Grigoriadis
12 Mar 2014
Cytotherapy | VOL. 16

Cloning of transmembrane domain sequence of EGFR gene
Wen-Xue Ma ... Jie Yan
Zhejiang da xue xue bao. Yi xue ban = Journal of Zhejiang University. Medical sciences | VOL. 31
Wen-Xue Ma, et. al.Wen-Xue Ma ... Jie Yan
01 Aug 2002
Zhejiang da xue xue bao. Yi xue ban = Journal of Zhejiang University. Medical sciences | VOL. 31

Glossary
Fran Lewitter ... Janet M Thornton
Trends in Biotechnology | VOL. 16
Fran Lewitter, et. al.Fran Lewitter ... Janet M Thornton
01 Nov 1998
Trends in Biotechnology | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SEGMENT: identifying compositional domains in DNA sequences.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)