Abstract

A systematic classification of protein–protein interfaces is a valuable resource for understanding the principles of molecular recognition and for modelling protein complexes. Here, we present a classification of domain interfaces according to their geometry. Our new algorithm uses a hybrid approach of both sequential and structural features. The accuracy is evaluated on a hand-curated dataset of 416 interfaces. Our hybrid procedure achieves 83% precision and 95% recall, which improves the earlier sequence-based method by 5% on both terms. We classify virtually all domain interfaces of known structure, which results in nearly 6,000 distinct types of interfaces. In 40% of the cases, the interacting domain families associate in multiple orientations, suggesting that all the possible binding orientations need to be explored for modelling multidomain proteins and protein complexes. In general, hub proteins are shown to use distinct surface regions (multiple faces) for interactions with different partners. Our classification provides a convenient framework to query genuine gene fusion, which conserves binding orientation in both fused and separate forms. The result suggests that the binding orientations are not conserved in at least one-third of the gene fusion cases detected by a conventional sequence similarity search. We show that any evolutionary analysis on interfaces can be skewed by multiple binding orientations and multiple interaction partners. The taxonomic distribution of interface types suggests that ancient interfaces common to the three major kingdoms of life are enriched by symmetric homodimers. The classification results are online at http://www.scoppi.org.

Highlights

  • Protein tertiary and quaternary structures often provide a deep insight into a protein’s function and its underlying mechanism

  • We classified virtually all the domain interfaces found in known structures, resulting in nearly 6,000 distinct interface types

  • Our hybrid procedure achieves a similar accuracy of 83% recall and 95% precision, while saving the amount of computation 100-fold

Read more

Summary

Introduction

Protein tertiary and quaternary structures often provide a deep insight into a protein’s function and its underlying mechanism. The authors provide a comprehensive compendium and classification of these structural interfaces To this end, they design a fast and accurate algorithm, which they apply to all known structural interactions. They design a fast and accurate algorithm, which they apply to all known structural interactions As a result, they shed light on the geometry and the evolution of protein interfaces. Interface Classification by Face Clustering Nussinov and colleagues classified interfaces based on common structural features shared among the interfaces from various folds [20,21]. Three Different Features Measuring the Similarity between Two Faces (A) Two faces in I set domain family (green and magenta) interacting with fibroblast growth factor (gray) in different binding orientations. For all the families in PQS, the hybrid method took 32 CPU days on a

Method
Conclusion
Materials and Methods
Findings
23 Family Pairs Common to All Three Kingdoms and Having Ten or More
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call