Assessment of Substitution Model Adequacy Using Frequentist and Bayesian Methods

Jennifer Ripplinger,Jack Sullivan

doi:10.1093/molbev/msq168

Abstract

In order to have confidence in model-based phylogenetic methods, such as maximum likelihood (ML) and Bayesian analyses, one must use an appropriate model of molecular evolution identified using statistically rigorous criteria. Although model selection methods such as the likelihood ratio test and Akaike information criterion are widely used in the phylogenetic literature, model selection methods lack the ability to reject all models if they provide an inadequate fit to the data. There are two methods, however, that assess absolute model adequacy, the frequentist Goldman-Cox (GC) test and Bayesian posterior predictive simulations (PPSs), which are commonly used in conjunction with the multinomial log likelihood test statistic. In this study, we use empirical and simulated data to evaluate the adequacy of common substitution models using both frequentist and Bayesian methods and compare the results with those obtained with model selection methods. In addition, we investigate the relationship between model adequacy and performance in ML and Bayesian analyses in terms of topology, branch lengths, and bipartition support. We show that tests of model adequacy based on the multinomial likelihood often fail to reject simple substitution models, especially when the models incorporate among-site rate variation (ASRV), and normally fail to reject less complex models than those chosen by model selection methods. In addition, we find that PPSs often fail to reject simpler models than the GC test. Use of the simplest substitution models not rejected based on fit normally results in similar but divergent estimates of tree topology and branch lengths. In addition, use of the simplest adequate substitution models can affect estimates of bipartition support, although these differences are often small with the largest differences confined to poorly supported nodes. We also find that alternative assumptions about ASRV can affect tree topology, tree length, and bipartition support. Our results suggest that using the simplest substitution models not rejected based on fit may be a valid alternative to implementing more complex models identified by model selection methods. However, all common substitution models may fail to recover the correct topology and assign appropriate bipartition support if the true tree shape is difficult to estimate regardless of model adequacy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Assessment of Substitution Model Adequacy Using Frequentist and Bayesian Methods

Abstract

Talk to us

Similar Papers

More From: Molecular Biology and Evolution

Lead the way for us

Journal: Molecular Biology and Evolution	Publication Date: Jul 8, 2010
Citations: 66

Similar Papers

Exploring Among-Site Rate Variation Models in a Maximum Likelihood Framework Using Empirical Data: Effects of Model Assumptions on Estimates of Topology, Branch Lengths, and Bootstrap Support
Thomas R Buckley ... Chris Simon
Systematic Biology | VOL. 50
Thomas R Buckley, et. al.Thomas R Buckley ... Chris Simon
01 Feb 2001
Systematic Biology | VOL. 50

Does Choice in Model Selection Affect Maximum Likelihood Analysis?
Jennifer Ripplinger ... Jack Sullivan
Systematic Biology | VOL. 57
Jennifer Ripplinger, et. al.Jennifer Ripplinger ... Jack Sullivan
01 Feb 2008
Systematic Biology | VOL. 57

Nucleotide Substitution Model Selection Is Not Necessary for Bayesian Inference of Phylogeny With Well-Behaved Priors.
Luiza Guimarães Fabreti ... Sebastian Höhna
Systematic Biology | VOL. 72
Luiza Guimarães Fabreti, et. al.Luiza Guimarães Fabreti ... Sebastian Höhna
17 Jul 2023
Systematic Biology | VOL. 72

When Trees Grow Too Long: Investigating the Causes of Highly Inaccurate Bayesian Branch-Length Estimates
Jeremy M Brown ... Alan R Lemmon
Systematic Biology | VOL. 59
Jeremy M Brown, et. al.Jeremy M Brown ... Alan R Lemmon
10 Dec 2009
Systematic Biology | VOL. 59

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessment of Substitution Model Adequacy Using Frequentist and Bayesian Methods

Abstract

Talk to us

Similar Papers

More From: Molecular Biology and Evolution