Abstract

Self-interacting proteins, whose two or more copies can interact with each other, play important roles in cellular functions and the evolution of protein interaction networks (PINs). Knowing whether a protein can self-interact can contribute to and sometimes is crucial for the elucidation of its functions. Previous related research has mainly focused on the structures and functions of specific self-interacting proteins, whereas knowledge on their overall properties is limited. Meanwhile, the two current most common high throughput protein interaction assays have limited ability to detect self-interactions because of biological artifacts and design limitations, whereas the bioinformatic prediction method of self-interacting proteins is lacking. This study aims to systematically study and predict self-interacting proteins from an overall perspective. We find that compared with other proteins the self-interacting proteins in the structural aspect contain more domains; in the evolutionary aspect they tend to be conserved and ancient; in the functional aspect they are significantly enriched with enzyme genes, housekeeping genes, and drug targets, and in the topological aspect tend to occupy important positions in PINs. Furthermore, based on these features, after feature selection, we use logistic regression to integrate six representative features, including Gene Ontology term, domain, paralogous interactor, enzyme, model organism self-interacting protein, and betweenness centrality in the PIN, to develop a proteome-wide prediction model of self-interacting proteins. Using 5-fold cross-validation and an independent test, this model shows good performance. Finally, the prediction model is developed into a user-friendly web service SLIPPER (SeLf-Interacting Protein PrEdictoR). Users may submit a list of proteins, and then SLIPPER will return the probability_scores measuring their possibility to be self-interacting proteins and various related annotation information. This work helps us understand the role self-interacting proteins play in cellular functions from an overall perspective, and the constructed prediction model may contribute to the high throughput finding of self-interacting proteins and provide clues for elucidating their functions.

Highlights

  • Self-interacting proteins are referred to as those whose two or more copies can interact with each other

  • Properties of Self-interacting Proteins—To understand the role that self-interacting proteins play in cellular functions from an overall perspective, we systematically analyzed their properties from multiple aspects

  • Self-interacting proteins are shown to be significantly enriched with enzyme genes and housekeeping genes, which are just consistent with our previous knowledge on the importance of homo-oligomerization in functional implementation of enzymes and more extensive cellular functions (Table II) [2]

Read more

Summary

Introduction

Self-interacting proteins are referred to as those whose two or more copies can interact with each other. The majority of the enzymes in the BRENDA database are self-interacting proteins [1], and the transition between different oligomeric states may regulate enzyme activity [2]. Many multiprotein complexes such as proteasome, ribosome, and nucleosome contain homodimers [3]. The ability to self-interact can confer several different structural and functional advantages to proteins, including improved stability, allosteric regulation, control over the accessibility and specificity of active sites, as well as increased complexity [1,2,3,4]. In 2010, Perez-Bercoff et al [6] found that genes encoding self-interacting proteins have higher duplication propensity

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call