A Dirichlet process model for detecting positive selection in protein-coding DNA sequences

John P. Huelsenbeck,Sergei L. Kosakovsky Pond,Sonia Jain,Simon W. D. Frost

doi:10.1073/pnas.0508279103

Abstract

Most methods for detecting Darwinian natural selection at the molecular level rely on estimating the rates or numbers of nonsynonymous and synonymous changes in an alignment of protein-coding DNA sequences. In some of these methods, the nonsynonymous rate of substitution is allowed to vary across the sequence, permitting the identification of single amino acid positions that are under positive natural selection. However, it is unclear which probability distribution should be used to describe how the nonsynonymous rate of substitution varies across the sequence. One widely used solution is to model variation in the nonsynonymous rate across the sequence as a mixture of several discrete or continuous probability distributions. Unfortunately, there is little population genetics theory to inform us of the appropriate probability distribution for among-site variation in the nonsynonymous rate of substitution. Here, we describe an approach to modeling variation in the nonsynonymous rate of substitution by using a Dirichlet process mixture model. The Dirichlet process allows there to be a countably infinite number of nonsynonymous rate classes and is very flexible in accommodating different potential distributions for the nonsynonymous rate of substitution. We implemented the model in a fully Bayesian approach, with all parameters of the model considered as random variables.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Dirichlet process model for detecting positive selection in protein-coding DNA sequences

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America

Lead the way for us

Journal: Proceedings of the National Academy of Sciences of the United States of America	Publication Date: Apr 18, 2006
Citations: 82

Similar Papers

A Conditional Autoregressive Model for Detecting Natural Selection in Protein-Coding DNA Sequences
Yu Fan ... Rui Wu
-
Yu Fan, et. al.Yu Fan ... Rui Wu
01 Jan 2013
01 Jan 2013

A Bayesian Small Area Model with Dirichlet Processes on the Responses
Jiani Yin ... Balgobin Nandram
Statistics in Transition new series | VOL. 21
Jiani Yin, et. al.Jiani Yin ... Balgobin Nandram
01 Sep 2020
Statistics in Transition new series | VOL. 21

Bayesian estimation of positively selected sites.
John P Huelsenbeck ... Kelly A Dyer
Journal Of Molecular Evolution | VOL. 58
John P Huelsenbeck, et. al.John P Huelsenbeck ... Kelly A Dyer
01 Jun 2004
Journal Of Molecular Evolution | VOL. 58

Estimating absolute rates of synonymous and nonsynonymous nucleotide substitution in order to characterize natural selection and date species divergences.
T.-K. Seo
Molecular Biology And Evolution | VOL. 21
T.-K. SeoT.-K. Seo
19 Mar 2004
Molecular Biology And Evolution | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Dirichlet process model for detecting positive selection in protein-coding DNA sequences

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America