Abstract
This paper is a report of a second round of RNA-Puzzles, a collective and blind experiment in three-dimensional (3D) RNA structure prediction. Three puzzles, Puzzles 5, 6, and 10, represented sequences of three large RNA structures with limited or no homology with previously solved RNA molecules. A lariat-capping ribozyme, as well as riboswitches complexed to adenosylcobalamin and tRNA, were predicted by seven groups using RNAComposer, ModeRNA/SimRNA, Vfold, Rosetta, DMD, MC-Fold, 3dRNA, and AMBER refinement. Some groups derived models using data from state-of-the-art chemical-mapping methods (SHAPE, DMS, CMCT, and mutate-and-map). The comparisons between the predictions and the three subsequently released crystallographic structures, solved at diffraction resolutions of 2.5–3.2 Å, were carried out automatically using various sets of quality indicators. The comparisons clearly demonstrate the state of present-day de novo prediction abilities as well as the limitations of these state-of-the-art methods. All of the best prediction models have similar topologies to the native structures, which suggests that computational methods for RNA structure prediction can already provide useful structural information for biological problems. However, the prediction accuracy for non-Watson–Crick interactions, key to proper folding of RNAs, is low and some predicted models had high Clash Scores. These two difficulties point to some of the continuing bottlenecks in RNA structure prediction. All submitted models are available for download at http://ahsoka.u-strasbg.fr/rnapuzzles/.
Highlights
More than 100,000 structures are currently available in the Protein Data Bank (PDB) (Berman et al 2000); RNA-containing structures take up
We report a second round focusing on the prediction of large RNA structures, a lariat-capping ribozyme, an adenosylcobalamin-binding riboswitch, and a T-box–tRNA complex (Peselis and Serganov 2012; Zhang and Ferre-D’Amare 2013; Meyer et al 2014)
To geometrically compare predicted models with the experimental structures, we used the Root Mean Square Deviation (RMSD) measure, the Deformation Index (DI), and the complete Deformation Profile matrix (DP) which provides an evaluation of the predictive quality of a model at multiple scales (Parisien et al 2009)
Summary
More than 100,000 structures are currently available in the Protein Data Bank (PDB) (Berman et al 2000); RNA-containing structures take up
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have