Abstract

Globally South-East Asia reported 40% of SARS-CoV-2 infected cases in the fourth week of April 2021. It continued to show an increase with India accounting for 50% of cases worldwide and 30% of global deaths. Genomic surveillance should continue at a rapid pace because of the continuously evolving nature of the virus. The time period of sample collection from the Global Initiative on Sharing All Influenza Data database was concurrent with the surge in new cases seen in the Indian subcontinent. 7,415 sequences were downloaded from Global Initiative on Sharing All Influenza Data between January and April 2021; out of which 4,411 were high coverage genome sequences and were considered for analysis. Phylogenetic analysis were carried out using Nextstrain. 21A or B.1.617 or delta was the most prevalent lineage in India accounting for 67.7% of the genomes. Next important clades were 20A, 20B and 20I accounting for 23.6%, 11.8% and 12.1% respectively collected between January 2021 and April 2021. The remaining sequences were assigned to clade 20H, 20J, 20D, 20C, 20G,20E,19A and 19B.The spike mutation frequencies of L452R, E484Q and P681R in Indian state of Maharashtra were 62.4%, 66.5% and 61.5% respectively. Two unique N-terminal domain deletion of spike protein were found at position 67 and 68. The phylogenomics of the delta variant or 21A emerged in neighboring Asian countries of Thailand, Bangladesh, Indonesia and Japan. We analyzed the SARS-CoV-2 genomes from India for mutation characterization of the spike glycoprotein and the nucleocapsid protein.

Highlights

  • South-East Asia reported 40% of SARS-CoV-2 infected cases in the fourth week of April 2021

  • We considered the sequences with high coverage for amino acid substitution analysis and phylogenetic clustering, for molecular analysis of sequences from non-Indian origin, high quality genome sequences with full coverage were retrieved from Global Initiative on Sharing All Influenza Data (GISAID)

  • Sequences comprising less than 0.01% were assigned to 20J (n = 1), 20D (n = 5), 20C (n = 1), 20G (n = 2), 20E (EU1) (n = 1), 19A (n = 5), and 19B (n = 4).There were twelve Nextstrain clades from sequences of India namely 21A, 20C, 20D, 20E, 20G, 20I, 20H, 20J, 19A,19B,20A and 20B as accessed on 25th May 2021

Read more

Summary

Introduction

South-East Asia reported 40% of SARS-CoV-2 infected cases in the fourth week of April 2021. It continued to show an increase with India accounting for 50% of cases worldwide and 30% of global deaths. Methodology: 7,415 sequences were downloaded from Global Initiative on Sharing All Influenza Data between January and April 2021; out of which 4,411 were high coverage genome sequences and were considered for analysis. We analyzed the SARS-CoV-2 genomes from India for mutation characterization of the spike glycoprotein and the nucleocapsid protein. The number of weekly new cases globally showed an increasing trend with 5.2 million reported during the third week of April 2021 [2]. The largest increase was reported by the South-East Asian region mainly India

Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call