Abstract

The problem addressed is that of estimating evolutionary relationship by the comparative study of the nucleic acid or protein sequences of living organisms. The most important point made in this account is that estimation of evolutionary relationship should be based on clearly defined models the assumptions of which are open to test. The models should as far as possible conform to what is known about the processes of evolutionary change in the organisms concerned. Prevailing approaches, grouped here as divergence models, are stated below in such a way that it is clear that they involve unrealistic assumptions about the nature of evolutionary change. Emphasis is placed on the use of probabilistic models of evolutionary change. The historical development of these models has proceeded in parallel with the more commonly used ‘parsimony’ methods. The problem of reconstructing phylogenies is simplified by assuming that the pathways of genetic transmission conform to a tree structure. The tree model is justified on the grounds that such pathways may be traced in a genealogy, however, the tree model ignores hybridization and horizontal transmission of the genetic material. The other essential component is a probabilistic formulation of the processes of genetic change. Consideration of genetic reliability (a view of mutation as failure correctly to copy information) leads to such a probabilistic description. Several proposed schemes which make numerical assessment of the relative frequencies of base substitution in DNA are considered. We next examine methods for the estimation of phylogenetic trees on the basis of probabilistic models. Pairwise estimates of divergence times lead rapidly to hypotheses of evolutionary relationship, but it is stressed that joint estimation procedures, which simultaneously take account of all the data, lead to more complete estimates of relationship. The various methods are illustrated as applied to the analysis of nucleic acid sequence data from the mammalian mitochondrial genome. Finally, we discuss weaknesses of the current stochastic models and point out ways in which accumulating experimental information may lead to their refinement or refutation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call