Bayesian inference of phylogenetic networks from bi-allelic genetic markers.

Jiafan Zhu,Yun Yu,Dingqiao Wen,Heidi M Meudt,Luay Nakhleh

doi:10.1371/journal.pcbi.1005932

Jiafan Zhu, Yun Yu + Show 3 more

Open Access

PDF Available

https://doi.org/10.1371/journal.pcbi.1005932

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Phylogenetic networks are rooted, directed, acyclic graphs that model reticulate evolutionary histories. Recently, statistical methods were devised for inferring such networks from either gene tree estimates or the sequence alignments of multiple unlinked loci. Bi-allelic markers, most notably single nucleotide polymorphisms (SNPs) and amplified fragment length polymorphisms (AFLPs), provide a powerful source of genome-wide data. In a recent paper, a method called SNAPP was introduced for statistical inference of species trees from unlinked bi-allelic markers. The generative process assumed by the method combined both a model of evolution for the bi-allelic markers, as well as the multispecies coalescent. A novel component of the method was a polynomial-time algorithm for exact computation of the likelihood of a fixed species tree via integration over all possible gene trees for a given marker. Here we report on a method for Bayesian inference of phylogenetic networks from bi-allelic markers. Our method significantly extends the algorithm for exact computation of phylogenetic network likelihood via integration over all possible gene trees. Unlike the case of species trees, the algorithm is no longer polynomial-time on all instances of phylogenetic networks. Furthermore, the method utilizes a reversible-jump MCMC technique to sample the posterior of phylogenetic networks given bi-allelic marker data. Our method has a very good performance in terms of accuracy and robustness as we demonstrate on simulated data, as well as a data set of multiple New Zealand species of the plant genus Ourisia (Plantaginaceae). We implemented the method in the publicly available, open-source PhyloNet software package.

Highlights

The availability of genome-wide data from many species and, in some cases, many individuals per species, has transformed the study of evolutionary histories, and given rise to phylogenomics—the inference of gene and species evolutionary histories from genome-wide data.Consider a data set S = {S1, . . ., Sm} consisting of the molecular sequences of m loci under the assumptions of free recombination between loci and no recombination within a locus
We introduce a novel algorithm for computing the likelihood of phylogenetic networks from bi-allelic genetic markers and use it in a Bayesian inference method
These networks and parameters were inspired by the phylogenetic networks inferred from an empirical genomic data set in [21, 22]

Summary

Introduction

. ., Sm} consisting of the molecular sequences of m loci under the assumptions of free recombination between loci and no recombination within a locus. The likelihood of a species phylogeny C (topology and parameters) is given by Ym Ym Z. The term p(Si|g) is the likelihood of gene tree g given the sequence data of locus i [1]. The term p(g|C) is the density function (pdf) of gene trees given the species phylogeny and its parameters. Rannala and Yang [2] derived this pdf under the multispecies coalescent (MSC). This formulation underlies the Bayesian inference methods of [2,3,4]

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Jan 10, 2018
Citations: 51	License type: CC BY 4.0

R Discovery Prime

Bayesian inference of phylogenetic networks from bi-allelic genetic markers.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Inferring Species Trees Directly from Biallelic Genetic Markers: Bypassing Gene Trees in a Full Coalescent Analysis
David Bryant ... Joseph Felsenstein
Molecular Biology and Evolution | VOL. 29
David Bryant, et. al.David Bryant ... Joseph Felsenstein
14 Mar 2012
Molecular Biology and Evolution | VOL. 29

Inferring Phylogenetic Networks Using PhyloNet
Dingqiao Wen ... Luay Nakhleh
Systematic Biology | VOL. 67
Dingqiao Wen, et. al.Dingqiao Wen ... Luay Nakhleh
05 Mar 2018
Systematic Biology | VOL. 67

Inference of species phylogenies from bi-allelic markers using pseudo-likelihood.
Jiafan Zhu ... Luay Nakhleh
Bioinformatics | VOL. 34
Jiafan Zhu, et. al.Jiafan Zhu ... Luay Nakhleh
27 Jun 2018
Bioinformatics | VOL. 34

Sixty-six Biallelic Genetic Markers on Y chromosome by MALDI-TOF-MS
Y T Song ... Y Lin
Fa yi xue za zhi | VOL. 33
Y T Song, et. al.Y T Song ... Y Lin
25 Jun 2017
Fa yi xue za zhi | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Bayesian inference of phylogenetic networks from bi-allelic genetic markers.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLOS Computational Biology