Abstract
Morphological data play a key role in the inference of biological relationships and evolutionary history and are essential for the interpretation of the fossil record. The hierarchical interdependence of many morphological characters, however, complicates phylogenetic analysis. In particular, many characters only apply to a subset of terminal taxa. The widely used “reductive coding” approach treats taxa in which a character is inapplicable as though the character’s state is simply missing (unknown). This approach has long been known to create spurious tree length estimates on certain topologies, potentially leading to erroneous results in phylogenetic searches—but pratical solutions have yet to be proposed and implemented. Here, we present a single-character algorithm for reconstructing ancestral states in reductively coded data sets, following the theoretical guideline of minimizing homoplasy over all characters. Our algorithm uses up to three traversals to score a tree, and a fourth to fully resolve final states at each node within the tree. We use explicit criteria to resolve ambiguity in applicable/inapplicable dichotomies, and to optimize missing data. So that it can be applied to single characters, the algorithm employs local optimization; as such, the method provides a fast but approximate inference of ancestral states and tree score. The application of our method to published morphological data sets indicates that, compared to traditional methods, it identifies different trees as “optimal.” As such, the use of our algorithm to handle inapplicable data may significantly alter the outcome of tree searches, modifying the inferred placement of living and fossil taxa and potentially leading to major differences in reconstructions of evolutionary history.
Highlights
Morphological characters are an essential source of data in phylogenetic studies
We have presented a single-character modified Fitch algorithm for ancestral state reconstructions that is aware of a special “inapplicable” token
This algorithm avoids logically impossible reconstructions of ancestral states by acknowledging that applicable state distributions rely on the prior resolution of dichotomies between applicable and inapplicable characters
Summary
Morphological characters are an essential source of data in phylogenetic studies. Even in the age of molecular sequence data, they underpin a range of research programmes that depend on knowledge of extinct or ancestral phenotypic conditions (e.g. palaeontology, molecular clock calibrations, comparative developmental biology). The essence of the problem is that existing parsimony methods measure the amount of homoplasy—the metric by which trees should be evaluated (De Laet, 2005)—by calculating the total number of character transformations This only works if character states refer exclusively to properties of homologous structures (Platnick, 1979). If transformations between applicable and inapplicable states contribute nothing to tree length, identical independent appearances (e.g. gains) of a character have no added cost, even when they are identical (i.e. share putative homology) This can, in some cases, result in a penalty for character congruence (Fig. 1), and a penalty for homology: a situation we consider inconsistent with the theory of phylogenetic parsimony. It is during this pass that the tracker values are updated, and the score of the tree is calculated
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.