Predicting RNA secondary structure via adaptive deep recurrent neural networks with energy-based filter

Weizhong Lu,Jing Qiu,Hongmei Huang,Haiou Li,Ye Tang,Qiming Fu,Hongjie Wu

doi:10.1186/s12859-019-3258-7

Abstract

BackgroundRNA secondary structure prediction is an important issue in structural bioinformatics, and RNA pseudoknotted secondary structure prediction represents an NP-hard problem. Recently, many different machine-learning methods, Markov models, and neural networks have been employed for this problem, with encouraging results regarding their predictive accuracy; however, their performances are usually limited by the requirements of the learning model and over-fitting, which requires use of a fixed number of training features. Because most natural biological sequences have variable lengths, the sequences have to be truncated before the features are employed by the learning model, which not only leads to the loss of information but also destroys biological-sequence integrity.ResultsTo address this problem, we propose an adaptive sequence length based on deep-learning model and integrate an energy-based filter to remove the over-fitting base pairs.ConclusionsComparative experiments conducted on an authoritative dataset RNA STRAND (RNA secondary STRucture and statistical Analysis Database) revealed a 12% higher accuracy relative to three currently used methods.

Highlights

Ribonucleic Acid (RNA) is a carrier of genetic information, and its structure plays a crucial role in gene maturation, regulation, and function [1,2,3]
The dataset of this paper comes from authoritative dataset RNA RNA secondary STRucture and statistical Analysis Database (STRAND) [32], including five subsets: TMR (The tmRNA website [33]),SPR (Sprinzl tRNA Database [34]),SRP (Signal recognition particle database [35]),RFA (The RNA family database [36])and ASE (RNase P Database [37]).There are 2493 sequences in the 5 datasets, the maximum and average length is 553 and 267.37 respectively
Comparison between adaptive-long short-term memory (LSTM) with and without energy-based filter To prove the validity of the energy-based filter, a comparative experiment was carried out on the five datasets

Summary

Introduction

RNA is a carrier of genetic information, and its structure plays a crucial role in gene maturation, regulation, and function [1,2,3]. For a valid secondary structure, each base, The secondary structure of an RNA molecule represents base-pair interactions that fundamentally determine overall structure [9,10,11]. RNA secondary structure prediction in the absence of pseudoknots has been studied using dynamic programming algorithms described by Zuker [14] and Mathews [15, 16] and employing m-fold [17] and GT-fold [18]. RNA pseudoknotted secondary structure prediction represents an NP-hard optimization problem [19]; in. RNA secondary structure prediction is an important issue in structural bioinformatics, and RNA pseudoknotted secondary structure prediction represents an NP-hard problem. Because most natural biological sequences have variable lengths, the sequences have to be truncated before the features are employed by the learning model, which leads to the loss of information and destroys biological-sequence integrity

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Dec 1, 2019
Citations: 21	License type: open-access

R Discovery Prime

R Discovery Prime

Predicting RNA secondary structure via adaptive deep recurrent neural networks with energy-based filter

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

RNA STRAND: The RNA Secondary Structure and Statistical Analysis Database
Mirela Andronescu ... Holger H Hoos
BMC Bioinformatics | VOL. 9
Mirela Andronescu, et. al.Mirela Andronescu ... Holger H Hoos
13 Aug 2008
BMC Bioinformatics | VOL. 9

RNA Secondary Structure Analysis Using RNAstructure.
Sara E Ali ... David H Mathews
Current protocols | VOL. 3
Sara E Ali, et. al.Sara E Ali ... David H Mathews
01 Jul 2023
Current protocols | VOL. 3

PSRna: Prediction of small RNA secondary structures based on reverse complementary folding method.
Jin Li ... Chengzhen Xu
Journal of bioinformatics and computational biology | VOL. 14
Jin Li, et. al.Jin Li ... Chengzhen Xu
01 Aug 2016
Journal of bioinformatics and computational biology | VOL. 14

Predicting RNA secondary structure based on machine learning and genetic algorithm
Duy Binh Doan ... Minh Tuan Pham
-
Duy Binh Doan, et. al.Duy Binh Doan ... Minh Tuan Pham
26 Nov 2020
26 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predicting RNA secondary structure via adaptive deep recurrent neural networks with energy-based filter

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics