Abstract

BackgroundMethods are now available for the prediction of interaction sites in protein 3D structures. While many of these methods report high success rates for site prediction, often these predictions are not very selective and have low precision. Precision in site prediction is addressed using Theoretical Microscopic Titration Curves (THEMATICS), a simple computational method for the identification of active sites in enzymes. Recall and precision are measured and compared with other methods for the prediction of catalytic sites.ResultsUsing a test set of 169 enzymes from the original Catalytic Residue Dataset (CatRes) it is shown that THEMATICS can deliver precise, localised site predictions. Furthermore, adjustment of the cut-off criteria can improve the recall rates for catalytic residues with only a small sacrifice in precision. Recall rates for CatRes/CSA annotated catalytic residues are 41.1%, 50.4%, and 54.2% for Z score cut-off values of 1.00, 0.99, and 0.98, respectively. The corresponding precision rates are 19.4%, 17.9%, and 16.4%. The success rate for catalytic sites is higher, with correct or partially correct predictions for 77.5%, 85.8%, and 88.2% of the enzymes in the test set, corresponding to the same respective Z score cut-offs, if only the CatRes annotations are used as the reference set. Incorporation of additional literature annotations into the reference set gives total success rates of 89.9%, 92.9%, and 94.1%, again for corresponding cut-off values of 1.00, 0.99, and 0.98. False positive rates for a 75-protein test set are 1.95%, 2.60%, and 3.12% for Z score cut-offs of 1.00, 0.99, and 0.98, respectively.ConclusionWith a preferred cut-off value of 0.99, THEMATICS achieves a high success rate of interaction site prediction, about 86% correct or partially correct using CatRes/CSA annotations only and about 93% with an expanded reference set. Success rates for catalytic residue prediction are similar to those of other structure-based methods, but with substantially better precision and lower false positive rates. THEMATICS performs well across the spectrum of E.C. classes. The method requires only the structure of the query protein as input. THEMATICS predictions may be obtained via the web from structures in PDB format at:

Highlights

  • Methods are available for the prediction of interaction sites in protein 3D structures

  • The identification of the interaction sites in protein structures is a critical step in the determination of function from the wealth of sequence and structure information emerging from genome sequencing and from structural genomics efforts [8,9,10,11,12,13,14]

  • We have shown that the shapes of the theoretical titration curves generated from a finite difference Poisson-Boltzmann (FDPB) method, they are only approximate, contain a great deal of useful information about the location, binding properties and chemical properties of the active site [1,6,7]

Read more

Summary

Introduction

Methods are available for the prediction of interaction sites in protein 3D structures While many of these methods report high success rates for site prediction, often these predictions are not very selective and have low precision. Precision in the prediction of sites is addressed using THEMATICS, a simple computational method for the identification of enzyme active sites from the three-dimensional structure alone [1,2,3,4,5,6,7]. One goal of the present paper is to show how the already good performance of THEMATICS can be improved and to quantify the recall and precision rates of the method through comparison of its predictions with the information from a database of enzymes with experimentally identified active sites. The identification of the interaction sites in protein structures is a critical step in the determination of function from the wealth of sequence and structure information emerging from genome sequencing and from structural genomics efforts [8,9,10,11,12,13,14]

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.