When is it safe to use an oversimplified substitution model in tree-making?

A Rzhetsky,T Sitnikova

doi:10.1093/oxfordjournals.molbev.a025691

Abstract

The choice of an "optimal" mathematical model for computing evolutionary distances from real sequences is not currently supported by easy-to-use software applicable to large data sets, and an investigator frequently selects one of the simplest models available. Here we study properties of the observed proportion of differences (p-distance) between sequences as an estimator of evolutionary distance for tree-making. We show that p-distances allow for consistent tree-making with any of the popular methods working with evolutionary distances if evolution of sequences obeys a "molecular clock" (more precisely, if it follows a stationary time-reversible Markov model of nucleotide substitution). Next, we show that p-distances seem to be efficient in recovering the correct tree topology under a "molecular clock," but produce "statistically supported" wrong trees when substitutions rates vary among evolutionary lineages. Finally, we outline a practical approach for selecting an "optimal" model of nucleotide substitution in a real data analysis, and obtain a crude estimate of a "prior" distribution of the expected tree branch lengths under the Jukes-Cantor model. We conclude that the use of a model that is obviously oversimplified is inadvisable unless it is justified by a preliminary analysis of the real sequences.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

When is it safe to use an oversimplified substitution model in tree-making?

Abstract

Talk to us

Similar Papers

More From: Molecular biology and evolution

Lead the way for us

Journal: Molecular biology and evolution	Publication Date: Nov 1, 1996
Citations: 62

Similar Papers

Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitution.
X Gu ... W H Li
Proceedings of the National Academy of Sciences of the United States of America | VOL. 95
X Gu, et. al.X Gu ... W H Li
26 May 1998
Proceedings of the National Academy of Sciences of the United States of America | VOL. 95

A general additive distance with time-reversibility and rate variation among nucleotide sites.
X Gu ... W H Li
Proceedings of the National Academy of Sciences of the United States of America | VOL. 93
X Gu, et. al.X Gu ... W H Li
14 May 1996
Proceedings of the National Academy of Sciences of the United States of America | VOL. 93

Selecting models of nucleotide substitution: an application to human immunodeficiency virus 1 (HIV-1).
David Posada ... Keith A Crandall
Molecular Biology and Evolution | VOL. 18
David Posada, et. al.David Posada ... Keith A Crandall
01 Jun 2001
Molecular Biology and Evolution | VOL. 18

A Theoretical Study of the Underestimation of Branch Lengths by the Maximum Parsimony Principle
N Saitou
Systematic Biology | VOL. 38
N SaitouN Saitou
01 Mar 1989
Systematic Biology | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

When is it safe to use an oversimplified substitution model in tree-making?

Abstract

Talk to us

Similar Papers

More From: Molecular biology and evolution