TransUFold: Unlocking the structural complexity of short and long RNA with pseudoknots.

Yunxiang Wang,Hong Zhang,Zhenchao Xu,Shouhua Zhang,Rui Guo

doi:10.3934/mbe.2023854

Yunxiang Wang, Hong Zhang + Show 3 more

Open Access

https://doi.org/10.3934/mbe.2023854

Copy DOI

Abstract

The RNA secondary structure is like a blueprint that holds the key to unlocking the mysteries of RNA function and 3D structure. It serves as a crucial foundation for investigating the complex world of RNA, making it an indispensable component of research in this exciting field. However, pseudoknots cannot be accurately predicted by conventional prediction methods based on free energy minimization, which results in a performance bottleneck. To this end, we propose a deep learning-based method called TransUFold to train directly on RNA data annotated with structure information. It employs an encoder-decoder network architecture, named Vision Transformer, to extract long-range interactions in RNA sequences and utilizes convolutions with lateral connections to supplement short-range interactions. Then, a post-processing program is designed to constrain the model's output to produce realistic and effective RNA secondary structures, including pseudoknots. After training TransUFold on benchmark datasets, we outperform other methods in test data on the same family. Additionally, we achieve better results on longer sequences up to 1600 nt, demonstrating the outstanding performance of Vision Transformer in extracting long-range interactions in RNA sequences. Finally, our analysis indicates that TransUFold produces effective pseudoknot structures in long sequences. As more high-quality RNA structures become available, deep learning-based prediction methods like Vision Transformer can exhibit better performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TransUFold: Unlocking the structural complexity of short and long RNA with pseudoknots.

Abstract

Talk to us

Similar Papers

More From: Mathematical biosciences and engineering : MBE

Lead the way for us

Journal: Mathematical biosciences and engineering : MBE	Publication Date: Jan 1, 2023
License type: cc-by

Similar Papers

A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming.
Hao Zhang ... Zhi Li
Frontiers in Genetics | VOL. 10
Hao Zhang, et. al.Hao Zhang ... Zhi Li
22 May 2019
Frontiers in Genetics | VOL. 10

Prediction of RNA secondary structure based on helical regions distribution.
L Wuju ... W Jiajin
Bioinformatics (Oxford, England) | VOL. 14
L Wuju, et. al.L Wuju ... W Jiajin
01 Jan 1998
Bioinformatics (Oxford, England) | VOL. 14

FASTR: A novel data format for concomitant representation of RNA sequence and secondary structure information.
Tungadri Bose ... Sharmila S Mande
Journal of biosciences | VOL. 40
Tungadri Bose, et. al.Tungadri Bose ... Sharmila S Mande
19 Aug 2015
Journal of biosciences | VOL. 40

Prediction of RNA structures containing pseudoknots
Dongkyu Lee ... Kyungsook Han
Interdisciplinary Bio Central | VOL. 1
Dongkyu Lee, et. al.Dongkyu Lee ... Kyungsook Han
28 Feb 2006
Interdisciplinary Bio Central | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TransUFold: Unlocking the structural complexity of short and long RNA with pseudoknots.

Abstract

Talk to us

Similar Papers

More From: Mathematical biosciences and engineering : MBE