Abstract

Abstract- Evaluating the reliability of methods for reconstructing evolutionary trees is discussed under the four headings of: evaluating criteria for an optimal tree, finding the optimal tree for the criterion selected, detecting reliable and unreliable data, and estimating the error range for the final tree. It is shown with five data sets (protein sequences) that, in general, the minimal tree is a better estimate of phylogeny than a longer tree. However, for each data set, the minimal tree was no longer the shortest when the sequences were combined. An objective weighting of columns (characters) can lead to an improved tree by giving less weight to columns that are closer to a random order. The weighting of characters is derived from the ratio of the observed to expected number of incompatabilities for each column. Several forms of weighting give better trees as measured by both the increase in correlation between lengths of trees with different subsets of data, and by an increase in the similarity between minimal trees found with disjoint subsets of data. Increasing the size of randomly selected subsets, and measuring the increased similarity of the results, can lead to an estimate of the minimum number of trees that need to be considered as possibly the correct historical tree. A measure of the 'treeness' of the data is described that estimates the extent to which a binary tree is a good description of the data.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.