Abstract

The evolutionary history of all life forms is usually represented as a vertical tree-like process. In prokaryotes, however, the vertical signal is partly obscured by the massive influence of horizontal gene transfer (HGT). The HGT creates widespread discordance between evolutionary histories of different genes as genomes become mosaics of gene histories. Thus, the Tree of Life (TOL) has been questioned as an appropriate representation of the evolution of prokaryotes. Nevertheless a common hypothesis is that prokaryotic evolution is primarily tree-like, and a routine effort is made to place new isolates in their appropriate location in the TOL. Moreover, it appears desirable to exploit non–tree-like evolutionary processes for the task of microbial classification. In this work, we present a novel technique that builds on the straightforward observation that gene order conservation (‘synteny’) decreases in time as a result of gene mobility. This is particularly true in prokaryotes, mainly due to HGT. Using a ‘synteny index’ (SI) that measures the average synteny between a pair of genomes, we developed the phylogenetic reconstruction tool ‘Phylo SI’. Phylo SI offers several attractive properties such as easy bootstrapping, high sensitivity in cases where phylogenetic signal is weak and computational efficiency. Phylo SI was tested both on simulated data and on two bacterial data sets and compared with two well-established phylogenetic methods. Phylo SI is particularly efficient on short evolutionary distances where synteny footprints remain detectable, whereas the nucleotide substitution signal is too weak for reliable sequence-based phylogenetic reconstruction. The method is publicly available at http://research.haifa.ac.il/ssagi/software/PhyloSI.zip.

Highlights

  • The ever decreasing sequencing costs, and improvements in assembly algorithms and automated annotation have resulted in thousands of bacterial genomes that are being sequenced each year

  • As the maximum likelihood (ML) reconstruction is considered more reliable than distance based, we chose to use the aligned sequences and in the Supplementary Material we provide results from the NJ tree from Ribosomal Database Project (RDP)

  • Our basic assumption is that the probability of a horizontal gene transfer (HGT) at a gene grows as a function of time [34], and a gene at two genomes will have a greater probability to reside at different k-neighborhood, the longer the time between divergence of the two genomes

Read more

Summary

Introduction

The ever decreasing sequencing costs, and improvements in assembly algorithms and automated annotation have resulted in thousands of bacterial genomes that are being sequenced each year. Ribosomal RNA genes often do not provide sufficient phylogenetic resolution or show intragenomic variability, and whole-genome data should ideally be used, but deriving taxonomy from these data can be difficult. Selecting multiple house-keeping genes and concatenating them, aligning them with similar concatenations from other bacteria and reconstructing phylogenies requires substantial knowhow, and still has significant drawbacks and cannot always fully resolve the position of the new taxon on the tree [1,2]. One would harness the whole genomic information and increase the phylogenetic signal, but genomes of bacteria and archaea are characterized by numerous horizontal gene transfer (HGT) [3,4,5,6]. Different genes in the genome can have different evolutionary histories and conflicting phylogenetic signals, so lumping them together could result in inaccurate organismal phylogenies

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.