Characterisation of SARS-CoV-2 clades based on signature SNPs unveils continuous evolution.

Nimisha Ghosh,Nikhil Sharma,Indrajit Saha,Suman Nandi

doi:10.1016/j.ymeth.2021.09.005

Abstract

Since the emergence of SARS-CoV-2 in Wuhan, China more than a year ago, it has spread across the world in a very short span of time. Although, different forms of vaccines are being rolled out for vaccination programs around the globe, the mutation of the virus is still a cause of concern among the research communities. Hence, it is important to study the constantly evolving virus and its strains in order to provide a much more stable form of cure. This fact motivated us to conduct this research where we have initially carried out multiple sequence alignment of 15359 and 3033 global dataset without Indian and the dataset of exclusive Indian SARS-CoV-2 genomes respectively, using MAFFT. Subsequently, phylogenetic analyses are performed using Nextstrain to identify virus clades. Consequently, the virus strains are found to be distributed among 5 major clades or clusters viz. 19A, 19B, 20A, 20B and 20C. Thereafter, mutation points as SNPs are identified in each clade. Henceforth, from each clade top 10 signature SNPs are identified based on their frequency i.e. number of occurrences in the virus genome. As a result, 50 such signature SNPs are individually identified for global dataset without Indian and dataset of exclusive Indian SARS-CoV-2 genomes respectively. Out of each 50 signature SNPs, 39 and 41 unique SNPs are identified among which 25 non-synonymous signature SNPs (out of 39) resulted in 30 amino acid changes in protein while 27 changes in amino acid are identified from 22 non-synonymous signature SNPs (out of 41). These 30 and 27 amino acid changes for the non-synonymous signature SNPs are visualised in their respective protein structure as well. Finally, in order to judge the characteristics of the identified clades, the non-synonymous signature SNPs are considered to evaluate the changes in proteins as biological functions with the sequences using PROVEAN and PolyPhen-2 while I-Mutant 2.0 is used to evaluate their structural stability. As a consequence, for global dataset without Indian sequences, G251V in ORF3a in clade 19A, F308Y and G196V in NSP4 and ORF3a in 19B are the unique amino acid changes which are responsible for defining each clade as they are all deleterious and unstable. Such changes which are common for both global dataset without Indian and dataset of exclusive Indian sequences are R203M in Nucleocapsid for 20B, T85I and Q57H in NSP2 and ORF3a respectively for 20C while for exclusive Indian sequences such unique changes are A97V in RdRp, G339S and G339C in NSP2 in 19A and Q57H in ORF3a in 20A.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Methods	Publication Date: Sep 20, 2021
Citations: 6	License type: NO-CC CODE

R Discovery Prime

R Discovery Prime

Characterisation of SARS-CoV-2 clades based on signature SNPs unveils continuous evolution.

Abstract

Talk to us

Similar Papers

More From: Methods

Lead the way for us

Similar Papers

Additional file 1: Table S1. of The first whole genome and transcriptome of the cinereous vulture reveals adaptation in the gastric and immune defense systems and possible convergent evolution between the Old and New World vultures
...
-
, et. al. ...
01 Jan 2015
01 Jan 2015

Nucleotide Sequence Analysis of S1 Gene among Iranian Avian Infectious Bronchitis Viruses Isolated during 2001-2002.
...
archives of razi institute | VOL. 74
, et. al. ...
01 Mar 2019
Nucleotide Sequence Analysis of S1 Gene among Iranian Avian Infectious Bronchitis Viruses Isolated during 2001-2002.
...

The extent of molecular variation in novel SARS-CoV-2 after the six-month global spread
Ngoc-Niem Bui ... Yu-Tzu Lin
Infection, Genetics and Evolution | VOL. 91
Ngoc-Niem Bui, et. al.Ngoc-Niem Bui ... Yu-Tzu Lin
05 Mar 2021
Infection, Genetics and Evolution | VOL. 91

Comparative sequences of two type 1 dengue virus strains possessing different growth characteristics in vitro.
Hasanuddin Ishak ... Hisashi Funada
Microbiology and Immunology | VOL. 45
Hasanuddin Ishak, et. al.Hasanuddin Ishak ... Hisashi Funada
01 Apr 2001
Microbiology and Immunology | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Characterisation of SARS-CoV-2 clades based on signature SNPs unveils continuous evolution.

Abstract

Talk to us

Similar Papers

More From: Methods