Abstract

BackgroundCoronavirus (COVID-19) was introduced into society in late 2019 and has now reached over 88 million cases and 1.9 million deaths. The Middle East has a death toll of ~80,000 and over 35000 of these are in Iran, which has over 1.2 million confirmed cases. We expect that Iranian cases caused outbreaks in the neighbouring countries and that variant mapping and phylogenetic analysis can be used to prove this. We also aim to analyse the variants of severe acute respiratory syndrome coronavirus-2 (SARS -CoV-2) to characterise the common genome variants and provide useful data in the global effort to prevent further spread of COVID-19. MethodsThe approach uses bioinformatics approaches including multiple sequence alignment, variant calling and annotation and phylogenetic analysis to identify the genomic variants found in the region. The approach uses 122 samples from the 13 countries of the Middle East sourced from the Global Initiative on Sharing All Influenza Data (GISAID). FindingsWe identified 2200 distinct genome variants including 129 downstream gene variants, 298 frame shift variants, 789 missense variants, 1 start lost, 13 start gained, 1 stop lost, 249 synonymous variants and 720 upstream gene variants. The most common, high impact variants were 10818delTinsG, 2772delCinsC, 14159delCinsC and 2789delAinsA. These high impact variant ultimately results in 36 number of mutations on spike glycoprotein. Variant alignment and phylogenetic tree generation indicates that samples from Iran likely introduced COVID-19 to the rest of the Middle East. InterpretationThe phylogenetic and variant analysis provides unique insight into mutation types in genomes. Initial introduction of COVID-19 was most likely due to Iranian transmission. Some countries show evidence of novel mutations and unique strains. Increased time in small populations is likely to contribute to more unique genomes. This study provides more in depth analysis of the variants affecting in the region than any other study.

Highlights

  • Since the discovery in Wuhan in late 2019, the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV2) virus has spread internationally to 213 countries with confirmed 6,194,533 cases and 376 320 deaths, at time of writing[1]

  • Source of Data- We obtained the data from the Chinese National Genomics Data Center (NGDC)

  • Saudi Arabia samples are all form a distinct group compared to other samples

Read more

Summary

Introduction

Since the discovery in Wuhan in late 2019, the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV2) virus has spread internationally to 213 countries with confirmed 6,194,533 cases and 376 320 deaths, at time of writing[1]. Of these cases, 536 148 are from the Eastern Mediterranean 1. Koyama et al.[4] used 48 genomes to map variants and classify sub-strains from many locations including China, Japan and the USA. The variants were mapped to Wuhan reference genome NC_045512.2 and were aligned using other Wuhan strains. We found variants in 43 of 50 genomes studied

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call