Abstract

An approach based on fractal scaling analysis to characterize the organization of the Covid-19 genome sequences is presented in this work. The method is based on a multivariate version of the fractal rescaled range analysis implemented on a sliding window scheme to detect variations of long-range correlations over the genome sequence domains. As a preliminary step, the nucleotide sequence is mapped in a numerical sequence by following a Voss rule, resulting in a multichannel sequence represented as a binary matrix. Fractal correlations, quantified in terms of the Hurst exponent, depending on the region of the sequence, where the Covid-19 genome sequences are predominantly random, with some patches of weak long-range correlations. The analysis shows that the regions of randomness are more abundant in the Covid-19 sequences than in the primitive SARS sequence, which suggests that the Covid-19 virus possesses a more diverse genomic structure for replication and infection. The analysis constrained to the surface glycoprotein region shows that the Covid-19 sequence is less random as compared to the SARS sequence, which indicates that the Covid-19 virus can undergo more ordered replications of the spike protein. The Omicron variation exhibits an interesting pattern with some randomness similarities with the other SARS and the Covid-19 genome sequences. Overall, the results show that the multivariate rescaled range analysis provides a suitable framework to assess long-term correlations hidden in the internal organization of the Covid-19 genome sequence.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call