Abstract

BackgroundWe introduce several forest-based and network-based methods for exploring microbial evolution, and apply them to the study of thousands of genes from 30 strains of E. coli. This case study illustrates how additional analyses could offer fast heuristic alternatives to standard tree of life (TOL) approaches.ResultsWe use gene networks to identify genes with atypical modes of evolution, and genome networks to characterize the evolution of genetic partnerships between E. coli and mobile genetic elements. We develop a novel polychromatic quartet method to capture patterns of recombination within E. coli, to update the clanistic toolkit, and to search for the impact of lateral gene transfer and of pathogenicity on gene evolution in two large forests of trees bearing E. coli. We unravel high rates of lateral gene transfer involving E. coli (about 40% of the trees under study), and show that both core genes and shell genes of E. coli are affected by non-tree-like evolutionary processes. We show that pathogenic lifestyle impacted the structure of 30% of the gene trees, and that pathogenic strains are more likely to transfer genes with one another than with non-pathogenic strains. In addition, we propose five groups of genes as candidate mobile modules of pathogenicity. We also present strong evidence for recent lateral gene transfer between E. coli and mobile genetic elements.ConclusionsDepending on which evolutionary questions biologists want to address (i.e. the identification of modules, genetic partnerships, recombination, lateral gene transfer, or genes with atypical evolutionary modes, etc.), forest-based and network-based methods are preferable to the reconstruction of a single tree, because they provide insights and produce hypotheses about the dynamics of genome evolution, rather than the relative branching order of species and lineages. Such a methodological pluralism - the use of woods and webs - is to be encouraged to analyse the evolutionary processes at play in microbial evolution.This manuscript was reviewed by: Ford Doolittle, Tal Pupko, Richard Burian, James McInerney, Didier Raoult, and Yan Boucher

Highlights

  • We introduce several forest-based and network-based methods for exploring microbial evolution, and apply them to the study of thousands of genes from 30 strains of E. coli

  • Our results indicate that an alternative approach to the tree of life (TOL) such as clanistics can sort out a forest of trees and make predictions regarding the possible implication of some gene families in pathogenicity and even specific diseases

  • Our present goal was certainly not to offer a new detailed picture of E. coli evolution, even though we could confirm many well known facts about the prevalence of recombination and lateral gene transfer (LGT) in E. coli [25,45], and support some new hypotheses

Read more

Summary

Introduction

We introduce several forest-based and network-based methods for exploring microbial evolution, and apply them to the study of thousands of genes from 30 strains of E. coli. The reconstruction of the tree of life (TOL) was an important goal of evolutionary science [1]. This inclusive hierarchical classification, through its genealogical structure, was expected to reflect the relative branching order of all biological lineages, as they diverged from a last common ancestor. This unique, universal, natural, and genealogical pattern was . Multiple genes, were used to propose a well resolved TOL, either by the concatenation of markers, by averaging their phylogenetic signal, or by a corroboration of their individual phylogenetic signals in congruence analyses that sought a hierarchical pattern shared by most of these genes [2,4,5]

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call