Abstract

BackgroundThe genetic diversity observed among bacteriophages remains a major obstacle for the identification of homologs and the comparison of their functional modules. In the structural module, although several classes of homologous proteins contributing to the head and tail structure can be detected, proteins of the head-to-tail connection (or neck) are generally more divergent. Yet, molecular analyses of a few tailed phages belonging to different morphological classes suggested that only a limited number of structural solutions are used in order to produce a functional virion. To challenge this hypothesis and analyze proteins diversity at the virion neck, we developed a specific computational strategy to cope with sequence divergence in phage proteins. We searched for homologs of a set of proteins encoded in the structural module using a phage learning database.ResultsWe show that using a combination of iterative profile-profile comparison and gene context analyses, we can identify a set of head, neck and tail proteins in most tailed bacteriophages of our database. Classification of phages based on neck protein sequences delineates 4 Types corresponding to known morphological subfamilies. Further analysis of the most abundant Type 1 yields 10 Clusters characterized by consistent sets of head, neck and tail proteins. We developed Virfam, a webserver that automatically identifies proteins of the phage head-neck-tail module and assign phages to the most closely related cluster of phages. This server was tested against 624 new phages from the NCBI database. 93% of the tailed and unclassified phages could be assigned to our head-neck-tail based categories, thus highlighting the large representativeness of the identified virion architectures. Types and Clusters delineate consistent subgroups of Caudovirales, which correlate with several virion properties.ConclusionsOur method and webserver have the capacity to automatically classify most tailed phages, detect their structural module, assign a function to a set of their head, neck and tail genes, provide their morphologic subtype and localize these phages within a “head-neck-tail” based classification. It should enable analysis of large sets of phage genomes. In particular, it should contribute to the classification of the abundant unknown viruses found on assembled contigs of metagenomic samples.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-1027) contains supplementary material, which is available to authorized users.

Highlights

  • The genetic diversity observed among bacteriophages remains a major obstacle for the identification of homologs and the comparison of their functional modules

  • To obtain a global view of the “structural module” in bacteriophages and probe whether newly sequenced phages can be assigned to already known systems, we developed a specific computational strategy able to cope with the high divergence and plasticity of phage genomes

  • We first searched for homologs of 5 head and tail proteins using the homology detection tool HHsearch [32], which relies on the comparison between pairs of multiple sequence alignments. This tool can reveal more remote homologies between proteins than the profile-sequence comparison tool PSI-Blast [31]. It was used with a high confidence probability threshold of 90% to search for the portal, large terminase subunit and major capsid protein (MCP) on the head side, the major tail (MTP) and sheath proteins on the tail side, among a set of 447 complete phage genomes, including 303 tailed phages (Caudovirales), 119 non tailed phages and 25 uncharacterised phages

Read more

Summary

Introduction

The genetic diversity observed among bacteriophages remains a major obstacle for the identification of homologs and the comparison of their functional modules. Molecular analyses of a few tailed phages belonging to different morphological classes suggested that only a limited number of structural solutions are used in order to produce a functional virion. To challenge this hypothesis and analyze proteins diversity at the virion neck, we developed a specific computational strategy to cope with sequence divergence in phage proteins. Phenotypic observation does not permit finer grained structural distinction among these tailed phages, so that molecular tools are needed to further classify them [7] At this family level, some classification problems arise, as some phages can be separated into Sipho-, Myo- and Podoviridae, while having closely related genomes [8,9]

Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.