Abstract

Abstract Three prevalent SARS-CoV-2 variants of concern (VOCs) emerged and caused epidemic waves. It is essential to uncover advantageous mutations that cause the high transmissibility of VOCs. However, viral mutations are tightly linked, so traditional population genetic methods, including machine learning–based methods, cannot reliably detect mutations conferring a fitness advantage. In this study, we developed an approach based on the sequential occurrence order of mutations and the accelerated furcation rate in the pandemic-scale phylogenomic tree. We analyzed 3,777,753 high-quality SARS-CoV-2 genomic sequences and the epidemiology metadata using the Coronavirus GenBrowser. We found that two noncoding mutations at the same position (g.a28271−/u) may be crucial to the high transmissibility of Alpha, Delta, and Omicron VOCs although the noncoding mutations alone cannot increase viral transmissibility. Both mutations cause an A-to-U change at the core position −3 of the Kozak sequence of the N gene and significantly reduce the protein expression ratio of ORF9b to N. Using a convergent evolutionary analysis, we found that g.a28271−/u, S:p.P681H/R, and N:p.R203K/M occur independently on three VOC lineages, suggesting that coordinated changes of S, N, and ORF9b proteins are crucial to high viral transmissibility. Our results provide new insights into high viral transmissibility co-modulated by advantageous noncoding and nonsynonymous changes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call