The embedding problem for markov models of nucleotide substitution.

Klara L Verbyla,Von Bing Yap,Yunli Shao,Anuj Pahwa,Gavin A Huttley

doi:10.1371/journal.pone.0069187

Abstract

Continuous-time Markov processes are often used to model the complex natural phenomenon of sequence evolution. To make the process of sequence evolution tractable, simplifying assumptions are often made about the sequence properties and the underlying process. The validity of one such assumption, time-homogeneity, has never been explored. Violations of this assumption can be found by identifying non-embeddability. A process is non-embeddable if it can not be embedded in a continuous time-homogeneous Markov process. In this study, non-embeddability was demonstrated to exist when modelling sequence evolution with Markov models. Evidence of non-embeddability was found primarily at the third codon position, possibly resulting from changes in mutation rate over time. Outgroup edges and those with a deeper time depth were found to have an increased probability of the underlying process being non-embeddable. Overall, low levels of non-embeddability were detected when examining individual edges of triads across a diverse set of alignments. Subsequent phylogenetic reconstruction analyses demonstrated that non-embeddability could impact on the correct prediction of phylogenies, but at extremely low levels. Despite the existence of non-embeddability, there is minimal evidence of violations of the local time homogeneity assumption and consequently the impact is likely to be minor.

Highlights

DNA sequences are widely used to infer evolutionary relationships among species, genes, and genomes
Like other complex natural phenomenon, simplifying assumptions are made for efficient computation
All probabilistic models of sequence evolution generally adopt a set of simplifying assumptions relating to the sequence properties and the evolutionary process to make the models computationally tractable and statistically efficient

Summary

Introduction

DNA sequences are widely used to infer evolutionary relationships among species, genes, and genomes. Like other complex natural phenomenon, simplifying assumptions are made for efficient computation. For sequence evolution maximum likelihood estimation for a probabilistic model is most common. This is because maximum likelihood estimation is statistically consistent (provided the underlying model is identifiable). All probabilistic models of sequence evolution generally adopt a set of simplifying assumptions relating to the sequence properties and the evolutionary process to make the models computationally tractable and statistically efficient. Stationarity assumes the process is in equilibrium resulting in equivalent ancestral and stationary base frequencies. A globally homogeneous process assumes that all branches share the same rate matrix. To relax the assumption of global time-homogeneity, some approaches allow separate substitution rate matrices for each branch of the tree (local time homogeneity)

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Jul 30, 2013
Citations: 58	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The embedding problem for markov models of nucleotide substitution.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Differential Influence of Nucleoside Analog-resistance Mutations K65R and L74V on the Overall Mutation Rate and Error Specificity of Human Immunodeficiency Virus Type 1 Reverse Transcriptase
Falguni S Shah ... Vinayaka R Prasad
Journal of Biological Chemistry | VOL. 275
Falguni S Shah, et. al.Falguni S Shah ... Vinayaka R Prasad
01 Sep 2000
Journal of Biological Chemistry | VOL. 275

Fidelity drive: A mechanism for chaperone proteins to maintain stable mutation rates in prokaryotes over evolutionary time
Julian Z Xue ... Frederic Guichard
Journal of Theoretical Biology | VOL. 364
Julian Z Xue, et. al.Julian Z Xue ... Frederic Guichard
21 Sep 2014
Journal of Theoretical Biology | VOL. 364

Chapter 6 - Continuous Time Markov Chains
Howard M. Taylor ... Samuel Karlin
An Introduction to Stochastic Modeling | VOL. -
Howard M. Taylor, et. al.Howard M. Taylor ... Samuel Karlin
01 Jan 1984
An Introduction to Stochastic Modeling | VOL. -

Coordinated Changes in Mutation and Growth Rates Induced by Genome Reduction.
Issei Nishimura ... Liu Liu
mBio | VOL. 8
Issei Nishimura, et. al.Issei Nishimura ... Liu Liu
05 Jul 2017
mBio | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The embedding problem for markov models of nucleotide substitution.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE