Abstract

The remarkable diversity in biological systems is rooted in the ability of the twenty naturally occurring amino acids to perform multifarious catalytic functions by creating unique structural scaffolds known as the active site. Finding such structrual motifs within the protein structure is a key aspect of many computational methods. The algorithm for obtaining combinations of motifs of a certain length, although polynomial in complexity, runs in non-trivial computer time. Also, the search space expands considerably if stereochemically equivalent residues are allowed to replace an amino acid in the motif. In the present work, we propose a method to precompile all possible motifs comprising of a set (n=4 in this case) of predefined amino acid residues from a protein structure that occur within a specified distance (R) of each other (PREMONITION). PREMONITION rolls a sphere of radius R along the protein fold centered at the C atom of each residue, and all possible motifs are extracted within this sphere. The number of residues that can occur within a sphere centered around a residue is bounded by physical constraints, thus setting an upper limit on the processing times. After such a pre-compilation step, the computational time required for querying a protein structure with multiple motifs is considerably reduced. Previously, we had proposed a computational method to estimate the promiscuity of proteins with known active site residues and 3D structure using a database of known active sites in proteins (CSA) by querying each protein with the active site motif of every other residue. The runtimes for such a comparison is reduced from days to hours using the PREMONITION methodology.

Highlights

  • The rapid development of crystallization techniques has resulted in a deluge of proteins with known structures[1]

  • Sequence alignment methods are not applicable in cases where similar functional groups are identically positioned in the active site of proteins with no sequence homology

  • We propose a method to precompile all possible motifs comprising of a set (n=4 in this case) of predefined amino acid residues from a protein structure that occur within a specified distance (R) of each other (PREMONITION - Preprocessing motifs in protein structures for search acceleration)

Read more

Summary

Introduction

The rapid development of crystallization techniques has resulted in a deluge of proteins with known structures[1]. Most of the proteins are annotated using sequence alignment methods by a ‘guilt by association’ logic based on the sequence-to-structure-to-function paradigm[2]. Sequence alignment methods are not applicable in cases where similar functional groups are identically positioned in the active site of proteins with no sequence homology. The classic example of this phenomenon, known as convergent evolution[3,4], is the major families of serine proteases (chymotrypsin and subtilisin), where the active site is structurally and functionally identical, though there is no global sequence or structural homology[5]. We have demonstrated through several detailed examples, using a method (CLASP19) which falls in the first category, that such structural conservation necessitates the conservation of electrostatic properties in proteins with the same functionality[19,20,21,22]

Objectives
Methods
Results

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.