Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.

Michael F Sloma,David H Mathews

doi:10.1371/journal.pcbi.1005827

Michael F Sloma, David H Mathews

Open Access

https://doi.org/10.1371/journal.pcbi.1005827

Copy DOI

Abstract

Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining the details of structure. Non-canonical pairs can be predicted using a knowledge-based model of structure that scores nucleotide cyclic motifs, or NCMs. In this work, a partition function algorithm is introduced that allows the estimation of base pairing probabilities for both canonical and non-canonical interactions. Pairs that are predicted to be probable are more likely to be found in the true structure than pairs of lower probability. Pair probability estimates can be further improved by predicting the structure conserved across multiple homologous sequences using the TurboFold algorithm. These pairing probabilities, used in concert with prior knowledge of the canonical secondary structure, allow accurate inference of non-canonical pairs, an important step towards accurate prediction of the full tertiary structure. Software to predict non-canonical base pairs and pairing probabilities is now provided as part of the RNAstructure software package.

Highlights

RNA tertiary structure predictionRNA plays a central role in all of life, acting as both a carrier of genetic information and as an active participant in numerous cellular processes, including pre-mRNA splicing and gene regulation via modulation of transcription and translation [1]
We developed a new method, CycleFold, that can identify non-canonical base pairs using statistical methods that have proven successful in predicting A-form helices
CycleFold provides a dramatic improvement in accuracy over previously available methods, and its output could be used to refine three dimensional structure predictions from any modeling software

Summary

Introduction

RNA tertiary structure predictionRNA plays a central role in all of life, acting as both a carrier of genetic information and as an active participant in numerous cellular processes, including pre-mRNA splicing and gene regulation via modulation of transcription and translation [1]. A wide range of computational methods were developed to automatically predict RNA structure These methods include fragment assembly [11,12,13,14], all-atom modeling with constraints from sequence comparison and experimental information using molecular mechanics [15,16], coarse-grained molecular simulation [17,18,19,20,21,22], coarse-grained helix-as-a-stick models [23,24,25], and homology modeling using a known structure [26]. All of these methods can use sparse information from experimental methods or low-resolution computational structure prediction, such as prediction of secondary structure, to reduce the search space of the tertiary structure problem and improve accuracy

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Nov 6, 2017
Citations: 27	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Conformational specificity of non-canonical base pairs and higher order structures in nucleic acids: crystal structure database analysis
Shayantani Mukherjee ... Dhananjay Bhattacharyya
Journal of Computer-Aided Molecular Design | VOL. 20
Shayantani Mukherjee, et. al.Shayantani Mukherjee ... Dhananjay Bhattacharyya
24 Nov 2006
Journal of Computer-Aided Molecular Design | VOL. 20

Computational methods toward accurate RNA structure prediction using coarse-grained and all-atom models.
Andrey Krokhotin ... Nikolay V Dokholyan
Methods in enzymology | VOL. 553
Andrey Krokhotin, et. al.Andrey Krokhotin ... Nikolay V Dokholyan
01 Jan 2015
Methods in enzymology | VOL. 553

TurboFold: Iterative probabilistic estimation of secondary structures for multiple RNA sequences
Arif O Harmanci ... David H Mathews
BMC Bioinformatics | VOL. 12
Arif O Harmanci, et. al.Arif O Harmanci ... David H Mathews
20 Apr 2011
BMC Bioinformatics | VOL. 12

Automated de novo prediction of native-like RNA tertiary structures.
Rhiju Das ... David Baker
Proceedings of the National Academy of Sciences | VOL. 104
Rhiju Das, et. al.Rhiju Das ... David Baker
11 Sep 2007
Proceedings of the National Academy of Sciences | VOL. 104

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology