Multiple Sequence Alignments Enhance Boundary Definition of RNA Structures.

Radhakrishnan Sabarinathan,Christian Anthon,Stefan Seemann,Jan Gorodkin

doi:10.3390/genes9120604

Radhakrishnan Sabarinathan, Christian Anthon + Show 2 more

Open Access

https://doi.org/10.3390/genes9120604

Copy DOI

Abstract

Self-contained structured domains of RNA sequences have often distinct molecular functions. Determining the boundaries of structured domains of a non-coding RNA (ncRNA) is needed for many ncRNA gene finder programs that predict RNA secondary structures in aligned genomes because these methods do not necessarily provide precise information about the boundaries or the location of the RNA structure inside the predicted ncRNA. Even without having a structure prediction, it is of interest to search for structured domains, such as for finding common RNA motifs in RNA-protein binding assays. The precise definition of the boundaries are essential for downstream analyses such as RNA structure modelling, e.g., through covariance models, and RNA structure clustering for the search of common motifs. Such efforts have so far been focused on single sequences, thus here we present a comparison for boundary definition between single sequence and multiple sequence alignments. We also present a novel approach, named RNAbound, for finding the boundaries that are based on probabilities of evolutionarily conserved base pairings. We tested the performance of two different methods on a limited number of Rfam families using the annotated structured RNA regions in the human genome and their multiple sequence alignments created from 14 species. The results show that multiple sequence alignments improve the boundary prediction for branched structures compared to single sequences independent of the chosen method. The actual performance of the two methods differs on single hairpin structures and branched structures. For the RNA families with branched structures, including transfer RNA (tRNA) and small nucleolar RNAs (snoRNAs), RNAbound improves the boundary predictions using multiple sequence alignments to median differences of −6 and −11.5 nucleotides (nts) for left and right boundary, respectively (window size of 200 nts).

Highlights

The function of RNA is often guided by its structural conformation, which is in turn determined by its sequence composition
The results show that multiple sequence alignments improve the boundary prediction for branched structures compared to single sequences independent of the chosen method
The three components can be described by scores based on base pairing probabilities: (1) score I bp is the geometric mean of paired probabilities between bases inside [k, l ]; (2) score O¬bp is the geometric mean of unpaired probabilities of bases inside [k, l ] to bases outside of [k, l ]; and (3) score

Summary

Introduction

The function of RNA is often guided by its structural conformation, which is in turn determined by its sequence composition. Long non-coding RNAs (lncRNAs) can contain local functional structures, e.g., lncRNA GAS5 forms a secondary structure that binds the. Defining the RNA structure domains has been addressed at the single sequence level, first explicitly by Dotu et al [2]. They described a fitness function for all segmentations of subwords of a sequence based on the base pairing probability matrix. These matrices are usually calculated from the respective sequence by McCaskill’s partition function approach [3].

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Genes	Publication Date: Dec 4, 2018
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multiple Sequence Alignments Enhance Boundary Definition of RNA Structures.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genes

Lead the way for us

Similar Papers

Heuristic Methods for Finding Pathogenic Variants in Gene Coding Sequences
Monique Ohanian ... Robyn Otway
Journal of the American Heart Association | VOL. 1
Monique Ohanian, et. al.Monique Ohanian ... Robyn Otway
26 Sep 2012
Journal of the American Heart Association | VOL. 1

Uncovering the Human Methyltransferasome
Tanya C Petrossian ... Steven G Clarke
Molecular & Cellular Proteomics | VOL. 10
Tanya C Petrossian, et. al.Tanya C Petrossian ... Steven G Clarke
01 Jan 2010
Molecular & Cellular Proteomics | VOL. 10

RMSA: A Sequence Search and Alignment Algorithm to Improve RNA Structure Modeling
Chengxin Zhang ... Anna Marie Pyle
Journal of Molecular Biology | VOL. 435
Chengxin Zhang, et. al.Chengxin Zhang ... Anna Marie Pyle
01 Dec 2022
Journal of Molecular Biology | VOL. 435

Multiple sequence alignment: Algorithms and applications
O Gotoh
Advances in Biophysics | VOL. 36
O GotohO Gotoh
01 Jan 1998
Advances in Biophysics | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiple Sequence Alignments Enhance Boundary Definition of RNA Structures.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genes