Understanding the origin and evolution of mutations in SARS-CoV-2 variants of concern (VOCs) is a critical area of research. B. Cao, X. Wang, W. Yin, Z. Gao, and B. Xia (mBio 15:e03187-23, 2024, https://doi.org/10.1128/mbio.03187-23) proposed that these mutations originated from bacterial sequences incorporated into the viral genome through stochastic template-switching by the viral RNA-dependent RNA polymerase (RdRp). Their analysis suggested that 62% of the viral mutation fragments (VMFs) in key SARS-CoV-2 proteins were identical to bacterial protein sequences. Given the implications of this finding, we re-examined the methods employed and argue that they resulted in false-positive findings. Specifically, the short query length of VMFs, seven amino acids, leads to spurious matches in large protein databases, as indicated by high BLAST Expect values. Furthermore, we analyzed the nucleotide sequence of VMFs, revealing no unique homology between SARS-CoV-2 and bacterial sequences. Consequently, the evidence does not support the hypothesis that bacterial sequences contribute to the evolution of SARS-CoV-2 VOCs. Instead, the emergence of these variants is more plausibly attributed to factors intrinsic to viral replication and evolution, such as the error-prone nature of RdRp, intrahost diversity, and recombination of related viral sublineages.
Read full abstract