Abstract

RNA molecules participate in many important biological processes, and they need to fold into well-defined secondary and tertiary structures to realize their functions. Like the well-known protein folding problem, there is also an RNA folding problem. The folding problem includes two aspects: structure prediction and folding mechanism. Although the former has been widely studied, the latter is still not well understood. Here we present a deep reinforcement learning algorithms 2dRNA-Fold to study the fastest folding paths of RNA secondary structure. 2dRNA-Fold uses a neural network combined with Monte Carlo tree search to select residue pairing step by step according to a given RNA sequence until the final secondary structure is formed. We apply 2dRNA-Fold to several short RNA molecules and one longer RNA 1Y26 and find that their fastest folding paths show some interesting features. 2dRNA-Fold is further trained using a set of RNA molecules from the dataset bpRNA and is used to predict RNA secondary structure. Since in 2dRNA-Fold the scoring to determine next step is based on possible base pairings, the learned or predicted fastest folding path may not agree with the actual folding paths determined by free energy according to physical laws.

Highlights

  • More and more studies have shown that ribonucleic acid (RNA) participates in many important biological processes [1,2,3,4]

  • We shall focus on the problem of the RNA folding pathway in secondary structure formation

  • From the perspective of exploring RNA folding path in secondary structure forFrom the perspective of exploring RNA folding path in secondary structure formation, mation, here we propose an RNA folding path learning algorithm based on deep reinhere we propose an RNA folding path learning algorithm based on deep reinforcement forcement learning, called 2dRNA-Fold

Read more

Summary

Introduction

More and more studies have shown that ribonucleic acid (RNA) participates in many important biological processes [1,2,3,4] They need to fold into well-defined secondary and tertiary structures to perform their functions. The “RNA folding problem” mainly includes two problems: RNA structure prediction and folding mechanism. The former deals with the determination of secondary and tertiary structures of an RNA directly from its sequences and has been widely studied [6,7,8,9,10,11,12,13,14,15]. We shall focus on the problem of the RNA folding pathway in secondary structure formation

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call