Abstract

BackgroundIt is well known that most of the binding free energy of protein interaction is contributed by a few key hot spot residues. These residues are crucial for understanding the function of proteins and studying their interactions. Experimental hot spots detection methods such as alanine scanning mutagenesis are not applicable on a large scale since they are time consuming and expensive. Therefore, reliable and efficient computational methods for identifying hot spots are greatly desired and urgently required.ResultsIn this work, we introduce an efficient approach that uses support vector machine (SVM) to predict hot spot residues in protein interfaces. We systematically investigate a wide variety of 62 features from a combination of protein sequence and structure information. Then, to remove redundant and irrelevant features and improve the prediction performance, feature selection is employed using the F-score method. Based on the selected features, nine individual-feature based predictors are developed to identify hot spots using SVMs. Furthermore, a new ensemble classifier, namely APIS (A combined model based on Protrusion Index and Solvent accessibility), is developed to further improve the prediction accuracy. The results on two benchmark datasets, ASEdb and BID, show that this proposed method yields significantly better prediction accuracy than those previously published in the literature. In addition, we also demonstrate the predictive power of our proposed method by modelling two protein complexes: the calmodulin/myosin light chain kinase complex and the heat shock locus gene products U and V complex, which indicate that our method can identify more hot spots in these two complexes compared with other state-of-the-art methods.ConclusionWe have developed an accurate prediction model for hot spot residues, given the structure of a protein complex. A major contribution of this study is to propose several new features based on the protrusion index of amino acid residues, which has been shown to significantly improve the prediction performance of hot spots. Moreover, we identify a compact and useful feature subset that has an important implication for identifying hot spot residues. Our results indicate that these features are more effective than the conventional evolutionary conservation, pairwise residue potentials and other traditional features considered previously, and that the combination of our and traditional features may support the creation of a discriminative feature set for efficient prediction of hot spot residues. The data and source code are available on web site http://home.ustc.edu.cn/~jfxia/hotspot.html.

Highlights

  • It is well known that most of the binding free energy of protein interaction is contributed by a few key hot spot residues

  • In light of these studies, we first designed and quantified a total of 62 multifaceted features from a combination of protein sequence and structure information. These features include: ten physicochemical characteristics, residue pairwise potential (Pp) at the interface, residue conservation (Rc), temperature factor (Tf ), and 49 structure features based on accessible surface area (ASA), depth index (DI) and protrusion index (PI)

  • Case studies To further illustrate the effectiveness of our approach APIS for identifying hot spot residues, we present two examples that are predicted by APIS, MINERVA and KFC using VMD software [54]

Read more

Summary

Introduction

It is well known that most of the binding free energy of protein interaction is contributed by a few key hot spot residues. These residues are crucial for understanding the function of proteins and studying their interactions. A database collecting such experimental hot spots is Alanine Scanning Energetics database (ASEdb) [10] Another database, i.e., binding interface database (BID), contains experimentally verified hot spots in protein-protein binding interfaces extracted from the literature [11]. Bogan and Thorn reported that hot spots are enriched in Tyr, Trp and Arg due to their size and conformation They found that hot spots are surrounded by energetically less important residues that shape like an O-ring to occlude bulk water molecules from the hot spots [12,13]

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.