Abstract

Recent experimental and computational progress has revealed a large potential for RNA structure in the genome. This has been driven by computational strategies that exploit multiple genomes of related organisms to identify common sequences and secondary structures. However, these computational approaches have two main challenges: they are computationally expensive and they have a relatively high false discovery rate (FDR). Simultaneously, RNA 3D structure analysis has revealed modules composed of non-canonical base pairs which occur in non-homologous positions, apparently by independent evolution. These modules can, for example, occur inside structural elements which in RNA 2D predictions appear as internal loops. Hence one question is if the use of such RNA 3D information can improve the prediction accuracy of RNA secondary structure at a genome-wide level. Here, we use RNAz in combination with 3D module prediction tools and apply them on a 13-way vertebrate sequence-based alignment. We find that RNA 3D modules predicted by metaRNAmodules and JAR3D are significantly enriched in the screened windows compared to their shuffled counterparts. The initially estimated FDR of 47.0% is lowered to below 25% when certain 3D module predictions are present in the window of the 2D prediction. We discuss the implications and prospects for further development of computational strategies for detection of RNA 2D structure in genomic sequence.

Highlights

  • Recent studies have shown a large potential for RNA structure in eukaryotes through experimental strategies [1,2,3,4] as well as computational strategies [5]

  • By inspection of the RMDetect score distributions we pragmatically find that the top 25% high scoring 3D modules on the original data make up a tail in the score distribution that separates from low-scoring and from the score distribution over the shuffled windows

  • To investigate the relationship between the 2D and 3D predictions we address the following: (1) whether 3D modules are enriched in the original windows as opposed to the shuffled ones; (2) whether individual 3D modules can be seen to be enriched; (3) even more importantly, we investigate if these models have a collective effect in lowering the false discovery rate (FDR) compared to RNAz alone

Read more

Summary

Introduction

Recent studies have shown a large potential for RNA structure in eukaryotes through experimental strategies [1,2,3,4] as well as computational strategies [5]. The computational strategies were developed to search for novel RNA secondary structure by making use of comparative. Transcriptome” (CHzS, IH) www.fwf.ac.at; Department of Health and Human Services, National Institutes of Health, NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES (2R01GM085328-05 to CZ) http://nih.gov/. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call