Abstract

BackgroundThe SARS-CoV-2 pandemic is one of the greatest global medical and social challenges that have emerged in recent history. Human coronavirus strains discovered during previous SARS outbreaks have been hypothesized to pass from bats to humans using intermediate hosts, e.g. civets for SARS-CoV and camels for MERS-CoV. The discovery of an intermediate host of SARS-CoV-2 and the identification of specific mechanism of its emergence in humans are topics of primary evolutionary importance. In this study we investigate the evolutionary patterns of 11 main genes of SARS-CoV-2. Previous studies suggested that the genome of SARS-CoV-2 is highly similar to the horseshoe bat coronavirus RaTG13 for most of the genes and to some Malayan pangolin coronavirus (CoV) strains for the receptor binding (RB) domain of the spike protein.ResultsWe provide a detailed list of statistically significant horizontal gene transfer and recombination events (both intergenic and intragenic) inferred for each of 11 main genes of the SARS-CoV-2 genome. Our analysis reveals that two continuous regions of genes S and N of SARS-CoV-2 may result from intragenic recombination between RaTG13 and Guangdong (GD) Pangolin CoVs. Statistically significant gene transfer-recombination events between RaTG13 and GD Pangolin CoV have been identified in region [1215–1425] of gene S and region [534–727] of gene N. Moreover, some statistically significant recombination events between the ancestors of SARS-CoV-2, RaTG13, GD Pangolin CoV and bat CoV ZC45-ZXC21 coronaviruses have been identified in genes ORF1ab, S, ORF3a, ORF7a, ORF8 and N. Furthermore, topology-based clustering of gene trees inferred for 25 CoV organisms revealed a three-way evolution of coronavirus genes, with gene phylogenies of ORF1ab, S and N forming the first cluster, gene phylogenies of ORF3a, E, M, ORF6, ORF7a, ORF7b and ORF8 forming the second cluster, and phylogeny of gene ORF10 forming the third cluster.ConclusionsThe results of our horizontal gene transfer and recombination analysis suggest that SARS-CoV-2 could not only be a chimera virus resulting from recombination of the bat RaTG13 and Guangdong pangolin coronaviruses but also a close relative of the bat CoV ZC45 and ZXC21 strains. They also indicate that a GD pangolin may be an intermediate host of this dangerous virus.

Highlights

  • The Severe acute respiratory syndrome (SARS)-CoV-2 pandemic is one of the greatest global medical and social challenges that have emerged in recent history

  • Our findings suggest that the SARS-CoV-2 genome may be formed via recombination of genomes close to the RaTG13 and GD Pangolin CoV genomes, and be a close relative of bat CoV ZC45 and ZXC21

  • Results we present a detailed analysis of putative gene transfer and recombination events that were detected in each of 11 main genes of SARS-CoV-2 as well as in the receptor binding (RB) domain of the spike protein

Read more

Summary

Introduction

The SARS-CoV-2 pandemic is one of the greatest global medical and social challenges that have emerged in recent history. Previous studies suggested that the genome of SARS-CoV-2 is highly similar to the horseshoe bat coronavirus RaTG13 for most of the genes and to some Malayan pangolin coronavirus (CoV) strains for the receptor binding (RB) domain of the spike protein. Five key amino acid residues taking part in the interaction with human angiotensin-converting enzyme 2 (ACE2) are completely identical in SARS-CoV-2 and GD Pangolin CoV These amino acids are different in the RB domain of RaTG13 and the Guangxi (GX) Pangolin CoV (virus found in Malayan pangolins in the Guangxi province of China). According to the third hypothesis, two or more close relatives of SARS-CoV-2 may have been affected by some recombination events within a host species Such a recombination could result in gene exchange between the CoV genomes. These models of reticulate evolution have been widely studied in the literature since the beginning of this century [2, 3, 10, 12, 18, 18, 23, 24, 26, 27, 34, 29, 35, 44,45,46]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call