Likelihood Inference of Non-Constant Diversification Rates with Incomplete Taxon Sampling

Sebastian Höhna

doi:10.1371/journal.pone.0084184

Abstract

Large-scale phylogenies provide a valuable source to study background diversification rates and investigate if the rates have changed over time. Unfortunately most large-scale, dated phylogenies are sparsely sampled (fewer than 5% of the described species) and taxon sampling is not uniform. Instead, taxa are frequently sampled to obtain at least one representative per subgroup (e.g. family) and thus to maximize diversity (diversified sampling). So far, such complications have been ignored, potentially biasing the conclusions that have been reached. In this study I derive the likelihood of a birth-death process with non-constant (time-dependent) diversification rates and diversified taxon sampling. Using simulations I test if the true parameters and the sampling method can be recovered when the trees are small or medium sized (fewer than 200 taxa). The results show that the diversification rates can be inferred and the estimates are unbiased for large trees but are biased for small trees (fewer than 50 taxa). Furthermore, model selection by means of Akaike's Information Criterion favors the true model if the true rates differ sufficiently from alternative models (e.g. the birth-death model is recovered if the extinction rate is large and compared to a pure-birth model). Finally, I applied six different diversification rate models – ranging from a constant-rate pure birth process to a decreasing speciation rate birth-death process but excluding any rate shift models – on three large-scale empirical phylogenies (ants, mammals and snakes with respectively 149, 164 and 41 sampled species). All three phylogenies were constructed by diversified taxon sampling, as stated by the authors. However only the snake phylogeny supported diversified taxon sampling. Moreover, a parametric bootstrap test revealed that none of the tested models provided a good fit to the observed data. The model assumptions, such as homogeneous rates across species or no rate shifts, appear to be violated.

Highlights

Patterns of biodiversity reflected in phylogenetic estimates indicate that (1) rates of diversification are not constant over time or across the tree and (2) taxonomic sampling is both incomplete and non-random
It is well known how to accommodate uniform taxon sampling, where every taxon has the same probability to be included in the dataset, in inference based on the birth-death process [6,10]
Even under a constant-rate pure birth process the Maximum Likelihood Estimation (MLE) was biased for trees with fewer than 50 taxa compared with the results of Morlon et al who found no bias, see Figure S4 in [8]

Summary

Introduction

Patterns of biodiversity reflected in phylogenetic estimates indicate that (1) rates of diversification are not constant over time or across the tree and (2) taxonomic sampling is both incomplete and non-random. Taxa are often selected so that the diversity is maximized, e.g. sampling at least one species per family [4,5]. This strategy is called diversified sampling [2]. The birth-death process with uniform taxon sampling has been extended to time-dependent rates [8] and diversity-dependent rates [11]. Diversified taxon sampling has only been considered in the context of constant rates [2] and, to my knowledge, the corresponding likelihood functions for non-constant rates have not been available previously

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Jan 6, 2014
Citations: 52	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Likelihood Inference of Non-Constant Diversification Rates with Incomplete Taxon Sampling

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Impacts of Taxon-Sampling Schemes on Bayesian Tip Dating Under the Fossilized Birth-Death Process.
Arong Luo ... Simon Y W Ho
Systematic Biology | VOL. 72
Arong Luo, et. al.Arong Luo ... Simon Y W Ho
15 Mar 2023
Systematic Biology | VOL. 72

Skyline Fossilized Birth-Death Model is Robust to Violations of Sampling Assumptions in Total-Evidence Dating.
Chi Zhang ... Fredrik Ronquist
Systematic Biology | VOL. 72
Chi Zhang, et. al.Chi Zhang ... Fredrik Ronquist
22 Aug 2023
Systematic Biology | VOL. 72

Total-Evidence Dating under the Fossilized Birth-Death Process.
Chi Zhang ... Tanja Stadler
Systematic Biology | VOL. 65
Chi Zhang, et. al.Chi Zhang ... Tanja Stadler
22 Oct 2015
Systematic Biology | VOL. 65

Inferring Speciation and Extinction Rates under Different Sampling Schemes
S Hohna ... T Britton
Molecular Biology and Evolution | VOL. 28
S Hohna, et. al.S Hohna ... T Britton
11 Apr 2011
Molecular Biology and Evolution | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Likelihood Inference of Non-Constant Diversification Rates with Incomplete Taxon Sampling

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE