Rapid catalytic template searching as an enzyme function prediction procedure.

Jerome P Nilmeier,Daniel A Kirshner,Felice C Lightstone,Sergio E Wong

doi:10.1371/journal.pone.0062535

Jerome P Nilmeier, Daniel A Kirshner + Show 2 more

Open Access

https://doi.org/10.1371/journal.pone.0062535

Copy DOI

Journal: PloS one	Publication Date: May 10, 2013
Citations: 94	License type: CC BY 4.0

Affiliation: Lawrence Livermore National Laboratory

Abstract

We present an enzyme protein function identification algorithm, Catalytic Site Identification (CatSId), based on identification of catalytic residues. The method is optimized for highly accurate template identification across a diverse template library and is also very efficient in regards to time and scalability of comparisons. The algorithm matches three-dimensional residue arrangements in a query protein to a library of manually annotated, catalytic residues – The Catalytic Site Atlas (CSA). Two main processes are involved. The first process is a rapid protein-to-template matching algorithm that scales quadratically with target protein size and linearly with template size. The second process incorporates a number of physical descriptors, including binding site predictions, in a logistic scoring procedure to re-score matches found in Process 1. This approach shows very good performance overall, with a Receiver-Operator-Characteristic Area Under Curve (AUC) of 0.971 for the training set evaluated. The procedure is able to process cofactors, ions, nonstandard residues, and point substitutions for residues and ions in a robust and integrated fashion. Sites with only two critical (catalytic) residues are challenging cases, resulting in AUCs of 0.9411 and 0.5413 for the training and test sets, respectively. The remaining sites show excellent performance with AUCs greater than 0.90 for both the training and test data on templates of size greater than two critical (catalytic) residues. The procedure has considerable promise for larger scale searches.

Highlights

Given the success of the structural genomics efforts (1125 PDB entries) and many genome sequencing efforts, automated protein function annotation is critical [1]
We developed an automated protein function identification method based on the hypothesis that catalytic residues and their geometric arrangement are key determinants for enzymatic chemical activity
We have developed an automated procedure for protein function prediction based on the identification of catalytic site residues, called the Catalytic Site Identification (CatSId)

Summary

Introduction

Given the success of the structural genomics efforts (1125 PDB entries) and many genome sequencing efforts, automated protein function annotation is critical [1]. At the core of many automated methods is the principle that sequence and structure dictate function. One approach is to infer function by focusing on global sequence or structural similarity. Methods that combine sequence and structural information include EFICAz [21,22], SOIPPA [23,24,25], DISCERN [26], PevoSOAR [27], and AnnoLite [28] and can provide improvements to sequence based methods alone. The success of global similarity-based techniques depends largely on the ability to distinguish conservation patterns that correspond to functional or catalytic portions of a protein sequence or structure. The approach we present in this work is designed to leverage the knowledge of specific catalytic site residues rather than to infer the functional features from global comparisons

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rapid catalytic template searching as an enzyme function prediction procedure.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

Networks of High Mutual Information Define the Structural Proximity of Catalytic Sites: Implications for Catalytic Residue Identification
Cristina Marino Buslje ... José María Delfino
PLoS Computational Biology | VOL. 6
Cristina Marino Buslje, et. al.Cristina Marino Buslje ... José María Delfino
04 Nov 2010
PLoS Computational Biology | VOL. 6

The Catalytic Site Atlas 2.0: cataloging catalytic sites and residues identified in enzymes
Nicholas Furnham ... William R Pearson
Nucleic Acids Research | VOL. 42
Nicholas Furnham, et. al.Nicholas Furnham ... William R Pearson
06 Dec 2013
Nucleic Acids Research | VOL. 42

The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data.
C T Porter
Nucleic Acids Research | VOL. 32
C T PorterC T Porter
01 Jan 2004
Nucleic Acids Research | VOL. 32

Active site prediction using evolutionary and structural information
Sriram Sankararaman ... Jack F Kirsch
Bioinformatics | VOL. 26
Sriram Sankararaman, et. al.Sriram Sankararaman ... Jack F Kirsch
14 Jan 2010
Bioinformatics | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rapid catalytic template searching as an enzyme function prediction procedure.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one