Abstract

SummaryEstimating viral timescales is fundamental in understanding the evolutionary biology of viruses. Molecular clocks are widely used to reveal the recent evolutionary histories of viruses but may severely underestimate their longer-term origins because of the inverse correlation between inferred rates of evolution and the timescale of their measurement. Here, we provide a predictive mechanistic model that readily explains the rate decay phenomenon over a wide range of timescales and recapitulates the ubiquitous power-law rate decay with a slope of −0.65. We show that standard substitution models fail to correctly estimate divergence times once the most rapidly evolving sites saturate, typically after hundreds of years in RNA viruses and thousands of years in DNA viruses. Our model successfully recreates the observed pattern of decay and explains the evolutionary processes behind the time-dependent rate phenomenon. We then apply our model to re-estimate the date of diversification of genotypes of hepatitis C virus to 423,000 (95% highest posterior density [HPD]: 394,000–454,000) years before present, a time preceding the dispersal of modern humans out of Africa, and show that the most recent common ancestor of sarbecoviruses dates back to 21,000 (95% HPD: 19,000–22,000) years ago, nearly thirty times older than previous estimates. This creates a new perspective for our understanding of the origins of these viruses and also suggests that a substantial revision of evolutionary timescales of other viruses can be similarly achieved.

Highlights

  • The timescale over which viruses evolve and how this process is connected to host adaptation has been an area of considerable research and methodological progress in recent decades

  • Inferred short-term rates of virus sequence change should create completely unrecognizable genome sequences if they were naively extrapolated over thousands, or even hundreds, of years, yet endogenous viral elements (EVEs) that integrated into host genomes throughout mammalian evolution are recognizably similar to contemporary genera and families of Bornaviridae, Parvoviridae, and Circoviridae, among many other examples.[13,14,15]

  • This observation is complemented by evidence from studies of virus/host co-evolution[16,17,18] and, more recently, from analyses of viruses recovered from ancient DNA and RNA in archaeological remains,[19,20,21,22] which indicate a remarkable degree of conservation in viral genome sequences and their interrelationships at genus and family levels

Read more

Summary

Introduction

The timescale over which viruses evolve and how this process is connected to host adaptation has been an area of considerable research and methodological progress in recent decades. Inferred short-term rates of virus sequence change should create completely unrecognizable genome sequences if they were naively extrapolated over thousands, or even hundreds, of years, yet endogenous viral elements (EVEs) that integrated into host genomes throughout mammalian evolution are recognizably similar to contemporary genera and families of Bornaviridae, Parvoviridae, and Circoviridae, among many other examples.[13,14,15] This observation is complemented by evidence from studies of virus/host co-evolution[16,17,18] and, more recently, from analyses of viruses recovered from ancient DNA and RNA in archaeological remains,[19,20,21,22] which indicate a remarkable degree of conservation in viral genome sequences and their interrelationships at genus and family levels This dichotomy has been attributed to the time-dependent rate phenomenon (TDRP), which is the observation that apparent rates of evolution are dependent on timescales of measurement.[23,24]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call