On the combinatorics of sparsification

Fenix Wd Huang,Christian M Reidys

doi:10.1186/1748-7188-7-28

Fenix Wd Huang, Christian M Reidys

Open Access

https://doi.org/10.1186/1748-7188-7-28

Copy DOI

Abstract

BackgroundWe study the sparsification of dynamic programming based on folding algorithms of RNA structures. Sparsification is a method that improves significantly the computation of minimum free energy (mfe) RNA structures.ResultsWe provide a quantitative analysis of the sparsification of a particular decomposition rule, Λ∗. This rule splits an interval of RNA secondary and pseudoknot structures of fixed topological genus. Key for quantifying sparsifications is the size of the so called candidate sets. Here we assume mfe-structures to be specifically distributed (see Assumption 1) within arbitrary and irreducible RNA secondary and pseudoknot structures of fixed topological genus. We then present a combinatorial framework which allows by means of probabilities of irreducible sub-structures to obtain the expectation of the Λ∗-candidate set w.r.t. a uniformly random input sequence. We compute these expectations for arc-based energy models via energy-filtered generating functions (GF) in case of RNA secondary structures as well as RNA pseudoknot structures. Furthermore, for RNA secondary structures we also analyze a simplified loop-based energy model. Our combinatorial analysis is then compared to the expected number of Λ∗-candidates obtained from the folding mfe-structures. In case of the mfe-folding of RNA secondary structures with a simplified loop-based energy model our results imply that sparsification provides a significant, constant improvement of 91% (theory) to be compared to an 96% (experimental, simplified arc-based model) reduction. However, we do not observe a linear factor improvement. Finally, in case of the “full” loop-energy model we can report a reduction of 98% (experiment).ConclusionsSparsification was initially attributed a linear factor improvement. This conclusion was based on the so called polymer-zeta property, which stems from interpreting polymer chains as self-avoiding walks. Subsequent findings however reveal that the O(n) improvement is not correct. The combinatorial analysis presented here shows that, assuming a specific distribution (see Assumption 1), of mfe-structures within irreducible and arbitrary structures, the expected number of Λ∗-candidates is Θ(n2). However, the constant reduction is quite significant, being in the range of 96%. We furthermore show an analogous result for the sparsification of the Λ∗-decomposition rule for RNA pseudoknotted structures of genus one. Finally we observe that the effect of sparsification is sensitive to the employed energy model.

Highlights

We study the sparsification of dynamic programming based on folding algorithms of RNA structures
Sparsification was initially attributed a linear factor improvement. This conclusion was based on the so called polymer-zeta property, which stems from interpreting polymer chains as self-avoiding walks
We show an analogous result for the sparsification of the ∗-decomposition rule for RNA pseudoknotted structures of genus one

Summary

Conclusions

Sparsification was initially attributed a linear factor improvement. This conclusion was based on the so called polymer-zeta property, which stems from interpreting polymer chains as self-avoiding walks. Subsequent findings reveal that the O(n) improvement is not correct. The combinatorial analysis presented here shows that, assuming a specific distribution (see Assumption 1), of mfe-structures within irreducible and arbitrary structures, the expected number of ∗-candidates is (n2). The constant reduction is quite significant, being in the range of 96%. We show an analogous result for the sparsification of the ∗-decomposition rule for RNA pseudoknotted structures of genus one. We observe that the effect of sparsification is sensitive to the employed energy model

Background

Methods

Conclusion

13. Zuker M

17. Akutsu T

27. Waterman MS

33. McCaskill JS

Findings

41. Zagier D

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Algorithms for Molecular Biology	Publication Date: Oct 22, 2012
Citations: 29	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

On the combinatorics of sparsification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms for Molecular Biology

Lead the way for us

Similar Papers

PseudoViewer: web application and web service for visualizing RNA pseudoknots and secondary structures
Y Byun ... K Han
Nucleic Acids Research | VOL. 34
Y Byun, et. al.Y Byun ... K Han
01 Jul 2006
Nucleic Acids Research | VOL. 34

Local Connectivity of Neutral Networks
Christian M Reidys
Bulletin of Mathematical Biology | VOL. 71
Christian M ReidysChristian M Reidys
30 Dec 2009
Bulletin of Mathematical Biology | VOL. 71

Inverse folding of RNA pseudoknot structures
James Zm Gao ... Linda Ym Li
Algorithms for Molecular Biology | VOL. 5
James Zm Gao, et. al.James Zm Gao ... Linda Ym Li
23 Jun 2010
Algorithms for Molecular Biology | VOL. 5

Using SHAPE-MaP To Model RNA Secondary Structure and Identify 3'UTR Variation in Chikungunya Virus.
Emily A Madden ... Rebecca Ellis Dutch
Journal of Virology | VOL. 94
Emily A Madden, et. al.Emily A Madden ... Rebecca Ellis Dutch
23 Nov 2020
Journal of Virology | VOL. 94

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the combinatorics of sparsification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms for Molecular Biology