Abstract

Soluble N-ethylmaleimide sensitive factor activating protein receptor (SNARE) proteins are a large family of transmembrane proteins located in organelles and vesicles. The important roles of SNARE proteins include initiating the vesicle fusion process and activating and fusing proteins as they undergo exocytosis activity, and SNARE proteins are also vital for the transport regulation of membrane proteins and non-regulatory vesicles. Therefore, there is great significance in establishing a method to efficiently identify SNARE proteins. However, the identification accuracy of the existing methods such as SNARE CNN is not satisfied. In our study, we developed a method based on a support vector machine (SVM) that can effectively recognize SNARE proteins. We used the position-specific scoring matrix (PSSM) method to extract features of SNARE protein sequences, used the support vector machine recursive elimination correlation bias reduction (SVM-RFE-CBR) algorithm to rank the importance of features, and then screened out the optimal subset of feature data based on the sorted results. We input the feature data into the model when building the model, used 10-fold crossing validation for training, and tested model performance by using an independent dataset. In independent tests, the ability of our method to identify SNARE proteins achieved a sensitivity of 68%, specificity of 94%, accuracy of 92%, area under the curve (AUC) of 84%, and Matthew’s correlation coefficient (MCC) of 0.48. The results of the experiment show that the common evaluation indicators of our method are excellent, indicating that our method performs better than other existing classification methods in identifying SNARE proteins.

Highlights

  • N-ethylmaleimide sensitive factor (NSF) (Whiteheart et al, 2001) protein and soluble NSF attachment proteins (SNAPS) (Whiteheart et al, 1993) are two essential factors for protein transport between membranes (Hohl et al, 1998) (Hanson et al, 1997)

  • At the molecular level, when the transport vesicle is close to the target membrane, syntaxin1A/B on the target membrane receives a signal to recognize, approach and combine with SNAP25, which is located on the target membrane

  • We developed a method to recognize sensitive factor activating protein receptor (SNARE) proteins based on position-specific scoring matrix (PSSM) (Chou and Shen, 2007; Liu et al, 2019b; Hong et al, 2020a; Hong et al, 2020b) profiles and support vector machine (SVM)

Read more

Summary

Introduction

N-ethylmaleimide sensitive factor (NSF) (Whiteheart et al, 2001) protein and soluble NSF attachment proteins (SNAPS) (Whiteheart et al, 1993) are two essential factors for protein transport between membranes (Hohl et al, 1998) (Hanson et al, 1997) They were first discovered as essential proteins for protein transport from donor to receptor subcellular structures during the processes of Golgi modification and secretion. VAMP2 (q-snare) on the transport vesicle recognizes (Kweon et al, 2003), draws close to and binds to form a 7S R-Q-SNARE complex, which guides the attachment and fusion of the transport vesicle and the target membrane, leading to the secretion of substances in the transport vesicle into the new subcellular structure or out of the cell through exocytosis, completing the intracellular transport and extracellular exocytosis and secretion processes

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call