Abstract

BackgroundDNA triplexes can naturally occur, co-localize and interact with many other regulatory DNA elements (e.g. G-quadruplex (G4) DNA motifs), specific DNA-binding proteins (e.g. transcription factors (TFs)), and micro-RNA (miRNA) precursors. Specific genome localizations of triplex target DNA sites (TTSs) may cause abnormalities in a double-helix DNA structure and can be directly involved in some human diseases. However, genome localization of specific TTSs, their interconnection with regulatory DNA elements and physiological roles in a cell are poor defined. Therefore, it is important to identify comprehensive and reliable catalogue of specific potential TTSs (pTTSs) and their co-localization patterns with other regulatory DNA elements in the human genome.Results"TTS mapping" database is a web-based search engine developed here, which is aimed to find and annotate pTTSs within a region of interest of the human genome. The engine provides descriptive statistics of pTTSs in a given region and its sequence context. Different annotation tracks of TTS-overlapping gene region(s), G4 motifs, CpG Island, miRNA precursors, miRNA targets, transcription factor binding sites (TFBSs), Single Nucleotide Polymorphisms (SNPs), small nucleolar RNAs (snoRNA), and repeat elements are also mapped based onto a sequence location provided by UCSC genome browser, G4 database http://www.quadruplex.org and several other datasets. The results pages provide links to UCSC genome browser annotation tracks and relative DBs. BLASTN program was included to check the uniqueness of a given pTTS in the human genome. Recombination- and mutation-prone genes (e.g. EVI-1, MYC) were found to be significantly enriched by TTSs and multiple co-occurring with our regulatory DNA elements. TTS mapping reveals that a high-complementary and evolutionarily conserved polypurine and polypyrimidine DNA sequence pair linked by a non-conserved short DNA sequence can form miR-483 transcribed from intron 2 of IGF2 gene and bound double-strand nucleic acid TTSs forming natural triplex structures.ConclusionTTS mapping provides comprehensive visual and analytical tools to help users to find pTTSs, G-quadruplets and other regulatory DNA elements in various genome regions. TTS Mapping not only provides sequence visualization and statistical information, but also integrates knowledge about co-localization TTS with various DNA elements and facilitates that data analysis. In particular, TTS Mapping reveals complex structural-functional regulatory module of gene IGF2 including TF MZF1 binding site and ncRNA precursor mir-483 formed by the high-complementary and evolutionarily conserved polypurine- and polypyrimidine-rich DNA pair. Such ncRNAs capable of forming helical triplex structures with a polypurine strand of a nucleic acid duplexes (DNA or RNA) via Hoogsteen or reverse Hoogsteen hydrogen bonds. Our web tool could be used to discover biologically meaningful genome modules and to optimize experimental design of anti-gene treatment.

Highlights

  • It is known that nucleic acids are flexible molecules that can exist in one of numerous spatial and temporal conformations

  • target DNA sites (TTSs) Mapping reveals complex structural-functional regulatory module of gene IGF2 including transcription factor (TF) MZF1 binding site and non-coding RNAs (ncRNAs) precursor mir-483 formed by the high-complementary and evolutionarily conserved polypurine- and polypyrimidine-rich DNA pair

  • We suggest that the TTSs could form triplexes in and TTS-rich regions included into intron 2 of IGF2 gene and, TTS-TFO complex might be involve in mechanisms of local epigenetic regulation of repressive heterochromatin directly or/and through the recruitment of specific proteins

Read more

Summary

Introduction

It is known that nucleic acids are flexible molecules that can exist in one of numerous spatial and temporal conformations. The existence of triple-stranded nucleic acids, usually named ‘triplexes’, was first demonstrated by Felsenfeld and collaborators [1] This conformation is obtained by the insertion of a third strand inside the major groove of a double-stranded (duplex) nucleic acid (Figure 1). The most stable triads are those involving a protonated cytosine (C+) forming C+*GC triad and T*AT nucleotides (the asterisks mark the nucleotides on the third strand) These sequences are overrepresented in all eukaryotic genomes, as well as in eukaryotic viruses [2,3]. This schema shows how the third strand of a nucleic acid can form a triplex structure with a polypurine strand of a nucleic acid duplex (or TTS) via Hoogsteen or reverse Hoogsteen hydrogen bond. It is important to identify comprehensive and reliable catalogue of specific potential TTSs (pTTSs) and their co-localization patterns with other regulatory DNA elements in the human genome

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call