Abstract

Studying similarities in protein molecules has become a fundamental activity in much of biology and biomedical research, for which methods such as multiple sequence alignments are widely used. Most methods available for such comparisons cater to studying proteins which have clearly recognizable evolutionary relationships but not to proteins that recognize the same or similar ligands but do not share similarities in their sequence or structural folds. In many cases, proteins in the latter class share structural similarities only in their binding sites. While several algorithms are available for comparing binding sites, there are none for deriving structural motifs of the binding sites, independent of the whole proteins. We report the development of SiteMotif, a new algorithm that compares binding sites from multiple proteins and derives sequence-order independent structural site motifs. We have tested the algorithm at multiple levels of complexity and demonstrate its performance in different scenarios. We have benchmarked against 3 current methods available for binding site comparison and demonstrate superior performance of our algorithm. We show that SiteMotif identifies new structural motifs of spatially conserved residues in proteins, even when there is no sequence or fold-level similarity. We expect SiteMotif to be useful for deriving key mechanistic insights into the mode of ligand interaction, predict the ligand type that a protein can bind and improve the sensitivity of functional annotation.

Highlights

  • It is known that not all residues are involved in ligand recognition

  • Recognising similarities and deriving relationships among protein molecules is an important task in present-day biology, often forming the basis for protein function annotation

  • We describe rigorous validation of SiteMotif at different levels of complexity and benchmark its performance against 3 current methods available for binding site comparison

Read more

Summary

Introduction

Recognising similarities and deriving relationships among protein molecules is an important task in present-day biology, often forming the basis for protein function annotation. There are several proteins that have no detectable sequence level similarities but show similarities in structure in the whole protein or just in the binding site and have the same function [5]. Comparison in such cases is more addressed by studying the three-dimensional structures of the ligand binding sites. Knowledge of the site motifs will (a) be useful in understanding the basis of ligand recognition, (b) enable comparison of proteins that share similarities only in their binding sites irrespective of sequence or fold level similarities and (c) facilitate prioritization of residues at the site in drug discovery applications

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call