Abstract

Herpesviruses (HVs, Family: Herpesviridae) have large genomes that encode hundreds of proteins. Apart from amino acid mutations, protein domain acquisitions, duplications and losses are also common modes of evolution. HV domain repertoires differ across species, and only a core set is shared among all species, aspect that raises a question: How have HV domain repertoires diverged while keeping some similarities? To answer such question, we used profile Hidden Markov Models (HMMs) to search for domains in all possible translated open reading frames (ORFs) of fully sequenced HV genomes. With at least 274 domains being identified, we built a matrix of domain counts per species, and applied a parsimony method to reconstruct the ancestral states of these domains along the HV phylogeny. It revealed events of domain gain, duplication, and loss over more than 400 millions of years, where Alpha-, Beta-, and GammaHVs expanded and condensed their domain repertoires at distinct rates. Most of the acquired domains perform ‘Modulation and Control’, ‘Envelope’, or ‘Auxiliary’ functions, categories that showed high flexibility (number of domains) and redundancy (number of copies). Conversely, few gains and duplications were observed for domains involved in ‘Capsid assembly and structure’, and ‘DNA Replication, recombination and metabolism’. Among the forty-one primordial domains encoded by Herpesviridae ancestors, twenty-eight are still found in all present-day HVs. Because of their distinct evolutionary strategies, HV domain repertoires are very specific at the subfamily, genus and species levels. Differences in domain composition may not only explain HV host range and tissue tropism, but also provide hints to the origins of HVs.

Highlights

  • HV domain repertoires differ across species, and only a core set is shared among all species, aspect that raises a question: How have HV domain repertoires diverged while keeping some similarities? To answer such question, we used profile Hidden Markov Models (HMMs) to search for domains in all possible translated open reading frames (ORFs) of fully sequenced HV genomes

  • To circumvent the limitations of simple sequence comparisons, which fail at identifying homology between distantly related proteins (Mocarski 2007), in this study we applied HMM profiles generated from Pfam alignments to identify domains encoded by fully sequenced herpesviral genomes

  • Since all possible ORFs were scanned for domains disregarding their sizes, initial codons, or whether or not they were annotated in the original genome annotations, our approach detected a large set of at least 274 domains encoded by members of the Herpesviridae family

Read more

Summary

Introduction

Viruses are known by their unorthodox modes of evolution and outstanding strategies of immune evasion and cell infection Their biological origins have long been debated, with many theories being proposed, without a consensus (Wessner 2010; Nasir and Caetano-Anolles 2015; Harish et al 2016), probably because there is no single answer for such complex question (Wessner 2010). When it comes to large DNA viruses, such as members of the family Herpesviridae (hereinafter referred as ‘herpesviruses’ or ‘HVs’), answering this question becomes even harder, specially taking into consideration their genomic complexity.

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call