The twilight zone of cis element alignments

Alvaro Sebastian,Bruno Contreras-Moreira

doi:10.1093/nar/gks1301

Abstract

Sequence alignment of proteins and nucleic acids is a routine task in bioinformatics. Although the comparison of complete peptides, genes or genomes can be undertaken with a great variety of tools, the alignment of short DNA sequences and motifs entails pitfalls that have not been fully addressed yet. Here we confront the structural superposition of transcription factors with the sequence alignment of their recognized cis elements. Our goals are (i) to test TFcompare (http://floresta.eead.csic.es/tfcompare), a structural alignment method for protein–DNA complexes; (ii) to benchmark the pairwise alignment of regulatory elements; (iii) to define the confidence limits and the twilight zone of such alignments and (iv) to evaluate the relevance of these thresholds with elements obtained experimentally. We find that the structure of cis elements and protein–DNA interfaces is significantly more conserved than their sequence and measures how this correlates with alignment errors when only sequence information is considered. Our results confirm that DNA motifs in the form of matrices produce better alignments than individual sequences. Finally, we report that empirical and theoretically derived twilight thresholds are useful for estimating the natural plasticity of regulatory sequences, and hence for filtering out unreliable alignments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nucleic Acids Research	Publication Date: Dec 24, 2012
Citations: 11	License type: CC BY-NC 3.0

R Discovery Prime

R Discovery Prime

The twilight zone of cis element alignments

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research

Lead the way for us

Similar Papers

Pairwise DNA Alignment with Sequence Specific Transition-Transversion Ratio Using Multiple Parameter Sets
Ankit Agrawal ... Xiaoqiu Huang
-
Ankit Agrawal, et. al.Ankit Agrawal ... Xiaoqiu Huang
01 Dec 2008
01 Dec 2008

Sequence-specific High Mobility Group Box Factors Recognize 10–12-Base Pair Minor Groove Motifs
Moniek Van Beest ... Hans Clevers
Journal of Biological Chemistry | VOL. 275
Moniek Van Beest, et. al.Moniek Van Beest ... Hans Clevers
01 Sep 2000
Sequence-specific High Mobility Group Box Factors Recognize 10–12-Base Pair Minor Groove Motifs
Moniek Van Beest ... Hans Clevers

Methods to define and locate patterns of motifs in sequences.
Rodger Staden
Computer applications in the biosciences : CABIOS | VOL. 4
Rodger StadenRodger Staden
01 Jan 1987
Computer applications in the biosciences : CABIOS | VOL. 4

DNAlignTT: Pairwise DNA alignment with sequence specific transition-transversion ratio
Ankit Agrawal ... Xiaoqiu Huang
-
Ankit Agrawal, et. al.Ankit Agrawal ... Xiaoqiu Huang
01 May 2008
01 May 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The twilight zone of cis element alignments

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research