Abstract

The Betacoronaviruses comprise multiple subgenera whose members have been implicated in human disease. As with SARS, MERS and now SARS-CoV-2, the origin and emergence of new variants are often attributed to events of recombination that alter host tropism or disease severity. In most cases, recombination has been detected by searches for excessively similar genomic regions in divergent strains; however, such analyses are complicated by the high mutation rates of RNA viruses, which can produce sequence similarities in distant strains by convergent mutations. By applying a genome-wide approach that examines the source of individual polymorphisms and that can be tested against null models in which recombination is absent and homoplasies can arise only by convergent mutations, we examine the extent and limits of recombination in Betacoronaviruses. We find that recombination accounts for nearly 40% of the polymorphisms circulating in populations and that gene exchange occurs almost exclusively among strains belonging to the same subgenus. Although experimental studies have shown that recombinational exchanges occur at random along the coronaviral genome, in nature, they are vastly overrepresented in regions controlling viral interaction with host cells.

Highlights

  • Issues surrounding the extent and frequency of recombination in the coronaviruses are of recent concern due to the investigations into the origins of SARS-CoV-2, the etiologic agent of COVID-19

  • Frequent mutations will (1) increase the likelihood of convergent mutations, in regions subject to strong positive selection, causing sequence similarities that can be mistaken for recombination events, and (2) introduce new changes that accumulate and obscure recognition of past recombination events

  • SARS-CoV-2 is most similar to SARS-like strains isolated from bats [1,2,3,4], one region of the SARS-CoV-2 genome—the sequence encoding the spike protein that assists in its invasion of host cells—resembles that of a divergent SARS strain isolated from Guangdong pangolins [3]

Read more

Summary

Introduction

Issues surrounding the extent and frequency of recombination in the coronaviruses are of recent concern due to the investigations into the origins of SARS-CoV-2, the etiologic agent of COVID-19. Analyses of SARS-CoV-2 nucleotide sequences showed that this virus has over 95% sequence similarity to SARS-like bat coronavirus RaTG13, hinting at its possible origins [1]. The RBD regions of SARS-CoV-2 and the bat coronavirus RaTG13 share only 90% amino-acid similarity [1] and overlap in only one of the six amino acids identified as critical residues [3]. The greater similarity of the RBD region to a more distantly related strain raises questions about whether the spike protein of SARS-CoV-2 evolved through a recombination event or, alternatively, by multiple independent mutations followed by selection, causing convergence to the more pangolin-derived sequence

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call