Abstract

The multispecies coalescent (MSC) is a statistical framework that models how gene genealogies grow within the branches of a species tree. The field of computational phylogenetics has witnessed an explosion in the development of methods for species tree inference under MSC, owing mainly to the accumulating evidence of incomplete lineage sorting in phylogenomic analyses. However, the evolutionary history of a set of genomes, or species, could be reticulate due to the occurrence of evolutionary processes such as hybridization or horizontal gene transfer. We report on a novel method for Bayesian inference of genome and species phylogenies under the multispecies network coalescent (MSNC). This framework models gene evolution within the branches of a phylogenetic network, thus incorporating reticulate evolutionary processes, such as hybridization, in addition to incomplete lineage sorting. As phylogenetic networks with different numbers of reticulation events correspond to points of different dimensions in the space of models, we devise a reversible-jump Markov chain Monte Carlo (RJMCMC) technique for sampling the posterior distribution of phylogenetic networks under MSNC. We implemented the methods in the publicly available, open-source software package PhyloNet and studied their performance on simulated and biological data. The work extends the reach of Bayesian inference to phylogenetic networks and enables new evolutionary analyses that account for reticulation.

Highlights

  • Species trees capture how species evolved and diverged from a common ancestor

  • The field of computational phylogenetics has witnessed an explosion in the development of methods for species tree inference under multispecies coalescent (MSC), owing mainly to the accumulating evidence of incomplete lineage sorting in phylogenomic analyses

  • The last two decades have witnessed a great flourish in the development and implementation of phylogenetic methods based on the multispecies coalescent model to capture the intricate relationship between gene and genome evolution

Read more

Summary

Introduction

Species trees capture how species evolved and diverged from a common ancestor These trees provide a framework for understanding how genes, genomes, and traits evolve [1, 2]. With the availability of data from multiple genomic regions, and often whole genomes, modern inference techniques utilize all these data and employ the multispecies coalescent (MSC) model [5]. This model captures how gene (more generally, non-recombining genomic regions) genealogies grow within the branches of a species tree when extending the coalescent model [6] to multiple populations tied together by a phylogenetic tree (Fig 1A). A wide array of methods have been devised for inferring species trees under the MSC model either directly from the sequence data [7,8,9] or from gene tree estimates [10, 11]; see [12,13,14] for recent reviews of species phylogeny inference methods

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call