Bayesian data mining of protein domains gives an efficient predictive algorithm and new insight

Rajani R Joshi,Vivekanand V Samant

doi:10.1007/s00894-006-0141-z

Abstract

Identification of structural domains in uncharacterized protein sequences is important in the prediction of protein tertiary folds and functional sites, and hence in designing biologically active molecules. We present a new predictive computational method of classifying a protein into single, two continuous or two discontinuous domains using Bayesian Data Mining. The algorithm requires only the primary sequence and computer-predicted secondary structure. It incorporates correlation patterns between certain 3-dimensional motifs and some local helical folds found conserved in the vicinity of protein domains with high statistical confidence. The prediction of domain-class by this computationally simple and fast method shows good accuracy of prediction-average accuracies 83.3% for single domain, 60% for two continuous and 65.7% for two discontinuous domain proteins. Experiments on the large validation sample show its performance to be significantly better than that of DGS and DomSSEA. Computations of Bayesian probabilities show important features in terms of correlation of certain conserved patterns of secondary folds and tertiary motifs and give new insight. Applications for improved accuracy of predicting domain boundary points relevant to protein structural and functional modeling are also highlighted.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bayesian data mining of protein domains gives an efficient predictive algorithm and new insight

Abstract

Talk to us

Similar Papers

More From: Journal of Molecular Modeling

Lead the way for us

Journal: Journal of Molecular Modeling	Publication Date: Oct 7, 2006
Citations: 5

Similar Papers

Prediction of Functional Sites in 50,000 Protein Domains Using Dynamics Perturbation Analysis
Judith D Cohn ... Michael E Wall
Biophysical Journal | VOL. 96
Judith D Cohn, et. al.Judith D Cohn ... Michael E Wall
01 Feb 2009
Prediction of Functional Sites in 50,000 Protein Domains Using Dynamics Perturbation Analysis
Judith D Cohn ... Michael E Wall

Improvement in protein functional site prediction by distinguishing structural and functional constraints on protein family evolution using computational design
G Cheng
Nucleic Acids Research | VOL. 33
G ChengG Cheng
12 Oct 2005
Nucleic Acids Research | VOL. 33

Text mining improves prediction of protein functional sites.
Karin M Verspoor ... Komandur E Ravikumar
PLoS ONE | VOL. 7
Karin M Verspoor, et. al.Karin M Verspoor ... Komandur E Ravikumar
29 Feb 2012
PLoS ONE | VOL. 7

Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking
Patrick Aloy ... Michael J.E Sternberg
Journal of Molecular Biology | VOL. 311
Patrick Aloy, et. al.Patrick Aloy ... Michael J.E Sternberg
01 Aug 2001
Journal of Molecular Biology | VOL. 311

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian data mining of protein domains gives an efficient predictive algorithm and new insight

Abstract

Talk to us

Similar Papers

More From: Journal of Molecular Modeling