Abstract

BackgroundEssential proteins are necessary for the survival and development of cells. The identification of essential proteins can help to understand the minimal requirements for cellular life and it also plays an important role in the disease genes study and drug design. With the development of high-throughput techniques, a large amount of protein-protein interactions data is available to predict essential proteins at the network level. Hitherto, even though a number of essential protein discovery methods have been proposed, the prediction precision still needs to be improved.MethodsIn this paper, we propose a new algorithm, improved Flower Pollination algorithm (FPA) for identifying Essential proteins, named FPE. Different from other existing essential protein discovery methods, we apply FPA which is a new intelligent algorithm imitating pollination behavior of flowering plants in nature to identify essential proteins. Analogous to flower pollination is to find optimal reproduction from the perspective of biological evolution, and the identification of essential proteins is to discover a candidate essential protein set by analyzing the corresponding relationships between FPA algorithm and the prediction of essential proteins, and redefining the positions of flowers and specific pollination process. Moreover, it has been proved that the integration of biological and topological properties can get improved precision for identifying essential proteins. Consequently, we develop a GSC measurement in order to judge the essentiality of proteins, which takes into account not only the Gene expression data, Subcellular localization and protein Complexes information, but also the network topology.ResultsThe experimental results show that FPE performs better than the state-of-the-art methods (DC, SC, IC, EC, LAC, NC, PeC, WDC, UDoNC and SON) in terms of the prediction precision, precision-recall curve and jackknife curve for identifying essential proteins and also has high stability.ConclusionsWe confirm that FPE can be used to effectively identify essential proteins by the use of nature-inspired algorithm FPA and the combination of network topology with gene expression data, subcellular localization and protein complexes information. The experimental results have shown the superiority of FPE for the prediction of essential proteins.

Highlights

  • Essential proteins are necessary for the survival and development of cells

  • In FPE, first the Q proteins with the highest degree in the protein-protein interaction (PPI) network are selected as initial position of pollen to improve efficiency of FPE algorithm and using a perturbance factor that is a constant between [0, 1] to make sure that each pollen is different

  • We redefine the update rules of the pollen’ s position and each pollen is updated by tailing the global optimal solution in each iteration since the global optimum can be viewed as a reliable guide for pollen to search better solution

Read more

Summary

Introduction

The identification of essential proteins can help to understand the minimal requirements for cellular life and it plays an important role in the disease genes study and drug design. Essential proteins are indispensable in the cellular life for the survival or development of an organism. Studies have shown that essential proteins are related to disease genes [2] and contribute to the prediction of drug targets [3]. Identifying essential proteins is conducive to the understanding of minimal requirements for cellular life, and important for the study of disease genes [4]. With the rapid development of high-throughput technologies, such as yeast two-hybrid screens [8], tandem affinity purification [9] and mass spectrometric analysis [10], a large amount of protein interaction data is detected, which provide new possibilities for the identification of essential proteins. It is becoming increasingly important to predict essential proteins by computational methods based on protein interaction data

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.