LazySampling and LinearSampling: fast stochastic sampling of RNA secondary structure with applications to SARS-CoV-2.

He Zhang,David H Mathews,Liang Huang,Liang Zhang,Sizhen Li

doi:10.1093/nar/gkac1029

Abstract

Many RNAs fold into multiple structures at equilibrium, and there is a need to sample these structures according to their probabilities in the ensemble. The conventional sampling algorithm suffers from two limitations: (i) the sampling phase is slow due to many repeated calculations; and (ii) the end-to-end runtime scales cubically with the sequence length. These issues make it difficult to be applied to long RNAs, such as the full genomes of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). To address these problems, we devise a new sampling algorithm, LazySampling, which eliminates redundant work via on-demand caching. Based on LazySampling, we further derive LinearSampling, an end-to-end linear time sampling algorithm. Benchmarking on nine diverse RNA families, the sampled structures from LinearSampling correlate better with the well-established secondary structures than Vienna RNAsubopt and RNAplfold. More importantly, LinearSampling is orders of magnitude faster than standard tools, being 428× faster (72 s versus 8.6 h) than RNAsubopt on the full genome of SARS-CoV-2 (29 903 nt). The resulting sample landscape correlates well with the experimentally guided secondary structure models, and is closer to the alternative conformations revealed by experimentally driven analysis. Finally, LinearSampling finds 23 regions of 15nt with high accessibilities in the SARS-CoV-2 genome, which are potential targets for COVID-19 diagnostics and therapeutics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nucleic Acids Research	Publication Date: Nov 18, 2022
Citations: 7	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

LazySampling and LinearSampling: fast stochastic sampling of RNA secondary structure with applications to SARS-CoV-2.

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research

Lead the way for us

Similar Papers

Potential Antiviral Target for SARS-CoV-2: A Key Early Responsive Kinase during Viral Entry
Siwen Liu ... Siu-Ying Lau
CCS Chemistry | VOL. 4
Siwen Liu, et. al.Siwen Liu ... Siu-Ying Lau
03 Mar 2021
CCS Chemistry | VOL. 4

Dynamics of B cell repertoires and emergence of cross-reactive responses in patients with different severities of COVID-19.
Zachary Montague ... Giulio Isacchini
Cell Reports | VOL. 35
Zachary Montague, et. al.Zachary Montague ... Giulio Isacchini
01 May 2021
Cell Reports | VOL. 35

Testing of Patients and Support Persons for Coronavirus Disease 2019 (COVID-19) Infection Before Scheduled Deliveries.
Angela Bianco ... Jessica Overbey
Obstetrics & Gynecology | VOL. 136
Angela Bianco, et. al.Angela Bianco ... Jessica Overbey
01 Aug 2020
Obstetrics & Gynecology | VOL. 136

Diagnostics and Spread of SARS-CoV-2 in Western Africa: An Observational Laboratory-Based Study from Benin
...
SSRN Electronic Journal | VOL. -
, et. al. ...
26 Jun 2020
SSRN Electronic Journal | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LazySampling and LinearSampling: fast stochastic sampling of RNA secondary structure with applications to SARS-CoV-2.

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research