Identifiability of Large Phylogenetic Mixture Models

John A Rhodes,Seth Sullivant

doi:10.1007/s11538-011-9672-2

Abstract

Phylogenetic mixture models are statistical models of character evolution allowing for heterogeneity. Each of the classes in some unknown partition of the characters may evolve by different processes, or even along different trees. Such models are of increasing interest for data analysis, as they can capture the variety of evolutionary processes that may be occurring across long sequences of DNA or proteins. The fundamental question of whether parameters of such a model are identifiable is difficult to address, due to the complexity of the parameterization. Identifiability is, however, essential to their use for statistical inference.We analyze mixture models on large trees, with many mixture components, showing that both numerical and tree parameters are indeed identifiable in these models when all trees are the same. This provides a theoretical justification for some current empirical studies, and indicates that extensions to even more mixture components should be theoretically well behaved. We also extend our results to certain mixtures on different trees, using the same algebraic techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identifiability of Large Phylogenetic Mixture Models

Abstract

Talk to us

Similar Papers

More From: Bulletin of Mathematical Biology

Lead the way for us

Journal: Bulletin of Mathematical Biology	Publication Date: Jun 30, 2011
Citations: 45

Similar Papers

When Do Phylogenetic Mixture Models Mimic Other Phylogenetic Models?
Elizabeth S Allman ... Seth Sullivant
Systematic Biology | VOL. 61
Elizabeth S Allman, et. al.Elizabeth S Allman ... Seth Sullivant
10 Sep 2012
Systematic Biology | VOL. 61

Phylogenetic mixture models for proteins
Si Quang Le ... Olivier Gascuel
Philosophical Transactions of the Royal Society B: Biological Sciences | VOL. 363
Si Quang Le, et. al.Si Quang Le ... Olivier Gascuel
07 Oct 2008
Philosophical Transactions of the Royal Society B: Biological Sciences | VOL. 363

On the artefactual parasitic eubacteria clan in conditioned logdet phylogenies: heterotachy and ortholog identification artefacts as explanations
Ajanthah Sangaralingam ... Edward Susko
BMC Evolutionary Biology | VOL. 10
Ajanthah Sangaralingam, et. al.Ajanthah Sangaralingam ... Edward Susko
09 Nov 2010
BMC Evolutionary Biology | VOL. 10

Determining the number of components in mixtures of linear models
Dollena S Hawkins ... Arnold J Stromberg
Computational Statistics & Data Analysis | VOL. 38
Dollena S Hawkins, et. al.Dollena S Hawkins ... Arnold J Stromberg
30 Oct 2001
Computational Statistics & Data Analysis | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identifiability of Large Phylogenetic Mixture Models

Abstract

Talk to us

Similar Papers

More From: Bulletin of Mathematical Biology