Bayesian Models and Markov Chain Monte Carlo Methods for Protein Motifs with the Secondary Characteristics

Jun Xie,Nak-Kyeong Kim

doi:10.1089/cmb.2005.12.952

Abstract

Statistical methods have been developed for finding local patterns, also called motifs, in multiple protein sequences. The aligned segments may imply functional or structural core regions. However, the existing methods often have difficulties in aligning multiple proteins when sequence residue identities are low (e.g., less than 25%). In this article, we develop a Bayesian model and Markov chain Monte Carlo (MCMC) methods for identifying subtle motifs in protein sequences. Specifically, a motif is defined not only in terms of specific sites characterized by amino acid frequency vectors, but also as a combination of secondary characteristics such as hydrophobicity, polarity, etc. Markov chain Monte Carlo methods are proposed to search for a motif pattern with high posterior probability under the new model. A special MCMC algorithm is developed, involving transitions between state spaces of different dimensions. The proposed methods were supported by a simulated study. It was then tested by two real datasets, including a group of helix-turn-helix proteins, and one set from the CATH Protein Structure Classification Database. Statistical comparisons showed that the new approach worked better than a typical Gibbs sampling approach which is based only on an amino acid model.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bayesian Models and Markov Chain Monte Carlo Methods for Protein Motifs with the Secondary Characteristics

Abstract

Talk to us

Similar Papers

More From: Journal of computational biology : a journal of computational molecular cell biology

Lead the way for us

Journal: Journal of computational biology : a journal of computational molecular cell biology	Publication Date: Sep 1, 2005
Citations: 18

Similar Papers

A comparison of Bayesian Markov chain Monte Carlo methods in a multilevel scenario
Darshika Karunarasan ... Vimukthini Pinto
Communications in Statistics - Simulation and Computation | VOL. ahead-of-print
Darshika Karunarasan, et. al.Darshika Karunarasan ... Vimukthini Pinto
15 Aug 2021
Communications in Statistics - Simulation and Computation | VOL. ahead-of-print

Parallel Computation of Reverse PageRank Problem with Evaluating Single Page
Siyan Lai ... Xiaola Lin
-
Siyan Lai, et. al.Siyan Lai ... Xiaola Lin
01 Oct 2016
01 Oct 2016

Sensitivity estimations for Bayesian inference models solved by MCMC methods
C.J Pérez ... M.J Rufo
Reliability Engineering & System Safety | VOL. 91
C.J Pérez, et. al.C.J Pérez ... M.J Rufo
06 Jan 2006
Reliability Engineering & System Safety | VOL. 91

Bayesian Variable Selection in Linear Regression in One Pass for Large Data Sets.
Carlos Ordonez ... Veerabhadaran Baladandayuthapani
ACM transactions on knowledge discovery from data | VOL. 9
Carlos Ordonez, et. al.Carlos Ordonez ... Veerabhadaran Baladandayuthapani
25 Aug 2014
ACM transactions on knowledge discovery from data | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian Models and Markov Chain Monte Carlo Methods for Protein Motifs with the Secondary Characteristics

Abstract

Talk to us

Similar Papers

More From: Journal of computational biology : a journal of computational molecular cell biology