Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data.

Mark N Puttick,Alastair R Tanner,James E Tarver,Luke A Parry,Jesus Lozano-Fernandez,Philip C J Donoghue,Joseph E O'Reilly,Lucy Holloway,Davide Pisani,James F Fleming,James Clark

doi:10.1098/rspb.2016.2290

Mark N Puttick, Alastair R Tanner + Show 9 more

Open Access

https://doi.org/10.1098/rspb.2016.2290

Copy DOI

Abstract

Morphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are less accurate than the Bayesian implementation of the Mk model. Here we expand on these findings in several ways: we assess the impact of tree shape and maximum-likelihood estimation using the Mk model, as well as analysing data composed of both binary and multistate characters. We find that all methods struggle to correctly resolve deep clades within asymmetric trees, and when analysing small character matrices. The Bayesian Mk model is the most accurate method for estimating topology, but with lower resolution than other methods. Equal weights parsimony is more accurate than implied weights parsimony, and maximum-likelihood estimation using the Mk model is the least accurate method. We conclude that the Bayesian implementation of the Mk model should be the default method for phylogenetic estimation from phenotype datasets, and we explore the implications of our simulations in reanalysing several empirical morphological character matrices. A consequence of our finding is that high levels of resolution or the ability to classify species or groups with much confidence should not be expected when using small datasets. It is now necessary to depart from the traditional parsimony paradigms of constructing character matrices, towards datasets constructed explicitly for Bayesian methods.

Highlights

The fossil record affords the only direct insight into evolutionary history of life on the Earth, but the incomplete preservation and temporal distribution of fossils has long prompted biologists to seek alternative perspectives, such as molecular phylogenies of living species, eschewing palaeontological evidence altogether [1]
These studies were potentially biased by their experimental design: (i) two of the studies employed a generating tree that was unresolved and, biased against parsimony methods which recover resolved trees; (ii) these studies did not discriminate between the impact of the probabilistic model and its implementation in a Bayesian framework; (iii) based on single empirical trees, the impact of tree symmetry, which is known to confound phylogeny estimation [10], was not explored; and (iv) only binary characters were considered, whereas empirical datasets are commonly a mixture of binary and multistate characters
This view is corroborated by our reanalysis of empirical datasets which recovered poorly resolved trees using the Bayesian implementation of the Mk model, and in a number of instances, indicate that the conclusions drawn in the corresponding original studies are not supported by the data

Summary

Introduction

The fossil record affords the only direct insight into evolutionary history of life on the Earth, but the incomplete preservation and temporal distribution of fossils has long prompted biologists to seek alternative perspectives, such as molecular phylogenies of living species, eschewing palaeontological evidence altogether [1]. A number of studies have attempted to establish the efficacy of competing phylogenetic methods using data simulated from known trees [7,8,9], finding that the probabilistic Mkv model outperforms parsimony methods, among which, conventional equal-weights parsimony (EW-Parsimony) performs best These studies were potentially biased by their experimental design: (i) two of the studies employed a generating tree that was unresolved and, biased against parsimony methods which recover resolved trees; (ii) these studies did not discriminate between the impact of the probabilistic model and its implementation in a Bayesian framework; (iii) based on single empirical trees, the impact of tree symmetry, which is known to confound phylogeny estimation [10], was not explored; and (iv) only binary characters were considered, whereas empirical datasets are commonly a mixture of binary and multistate characters. The Mkv extension of the Mk model, which uses conditional likelihood to correct for such acquisition biases, is more appropriate than the Mk model for analysis of these empirical data matrices [6]

Results

Discussion

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the Royal Society B: Biological Sciences	Publication Date: Jan 11, 2017
Citations: 112	License type: cc-by

R Discovery Prime

R Discovery Prime

Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the Royal Society B: Biological Sciences

Lead the way for us

Similar Papers

Confirmatory Factor Analyses in Psychological Test Adaptation and Development
Kay Brauer ... Matthias Ziegler
Psychological Test Adaptation and Development | VOL. 4
Kay Brauer, et. al.Kay Brauer ... Matthias Ziegler
01 Feb 2023
Psychological Test Adaptation and Development | VOL. 4

Towards a phylogeny of gastropod molluscs: an analysis using morphological characters
Winston F Ponder
Zoological Journal of the Linnean Society | VOL. 119
Winston F PonderWinston F Ponder
01 Feb 1997
Zoological Journal of the Linnean Society | VOL. 119

Towards a phylogeny of gastropod molluscs: an analysis using morphological characters
Winston F Ponder ... David R Lindberg
Zoological Journal of the Linnean Society | VOL. 119
Winston F Ponder, et. al.Winston F Ponder ... David R Lindberg
01 Feb 1997
Zoological Journal of the Linnean Society | VOL. 119

Likelihood approximations of implied weights parsimony can be selected over the Mk model by the Akaike information criterion.
Pablo A Goloboff ... J Salvador Arias
Cladistics | VOL. 35
Pablo A Goloboff, et. al.Pablo A Goloboff ... J Salvador Arias
25 Mar 2019
Cladistics | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the Royal Society B: Biological Sciences