Abstract

Membrane protein is an important type of proteins and has been confirmed to play essential roles in various cellular processes. Based on their intramolecular arrangements and positions in a cell, they can be categorized into several types. However, it is time- and cost-consuming to recognize the type of a given membrane protein via traditional biophysical methods. In view of this, several computational models have been proposed in recent years. Most models adopted various information of membrane proteins, such as their sequences, domain profiles, physiochemical properties, etc. to extract different features, which were fed into downstream classification algorithms. In this study, we built two novel prediction models, which incorporated novel feature extraction methods, i.e., network embedding methods. To this end, several protein networks were constructed using the protein-protein interaction information retrieved from STRING. Among these models, one model was constructed based on features obtained by applying Mashup on seven protein networks, another model was built using features yielded by Node2Vec on one comprehensive protein network. Each model adopted random forest as the classification algorithm and employed the Synthetic Minority Over-sampling Technique (SMOTE) to overcome the influence yielded by the great difference on sizes of different membrane protein types. Furthermore, two models were integrated into one model to improve the predicted quality. The test results shown that the integrated model had good performance and was superior to any individual model. Also, we compared our models with some previous models, suggesting that our models were competitive.

Highlights

  • Membrane protein is an important type of proteins that can interact with biological membranes

  • PERFORMANCE OF TWO INDIVIDUAL NETWORK MODELS ON THE TRAINING DATASET Two individual network models using random forest (RF) as the classification algorithm were proposed for predicting types of membrane proteins

  • Compared with precisions of RF integrated model (Figure 4(B)), support vector machine (SVM) integrated model generated lower precisions on all five types. All these results indicated that the RF integrated model was much superior to the SVM integrated model

Read more

Summary

Introduction

Membrane protein is an important type of proteins that can interact with biological membranes. They play various roles in cells biology, such as transporting substances to help them enter and leave cells, transmitting signals as specific receptors for receiving hormones or other chemicals, etc. Knowing the type of a given membrane protein is very helpful to infer its function because it has been confirmed that the membrane protein type is highly related to its function [3]. It is very meaningful to determine the types of membrane proteins. Such determination via traditional experiments is time- and cost-consuming.

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.