Abstract

BackgroundSevere acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) is an emerging disease with fatal outcomes. In this study, a fundamental knowledge gap question is to be resolved by evaluating the differences in biological and pathogenic aspects of SARS‐CoV‐2 and the changes in SARS‐CoV‐2 in comparison with the two prior major COV epidemics, SARS and Middle East respiratory syndrome (MERS) coronaviruses.MethodsThe genome composition, nucleotide analysis, codon usage indices, relative synonymous codons usage, and effective number of codons (ENc) were analyzed in the four structural genes; Spike (S), Envelope (E), membrane (M), and Nucleocapsid (N) genes, and two of the most important nonstructural genes comprising RNA‐dependent RNA polymerase and main protease (Mpro) of SARS‐CoV‐2, Beta‐CoV from pangolins, bat SARS, MERS, and SARS CoVs.ResultsSARS‐CoV‐2 prefers pyrimidine rich codons to purines. Most high‐frequency codons were ending with A or T, while the low frequency and rare codons were ending with G or C. SARS‐CoV‐2 structural proteins showed 5 to 20 lower ENc values, compared with SARS, bat SARS, and MERS CoVs. This implies higher codon bias and higher gene expression efficiency of SARS‐CoV‐2 structural proteins. SARS‐CoV‐2 encoded the highest number of over‐biased and negatively biased codons. Pangolin Beta‐CoV showed little differences with SARS‐CoV‐2 ENc values, compared with SARS, bat SARS, and MERS CoV.ConclusionExtreme bias and lower ENc values of SARS‐CoV‐2, especially in Spike, Envelope, and Mpro genes, are suggestive for higher gene expression efficiency, compared with SARS, bat SARS, and MERS CoVs.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call