Abstract

As the pathogen of malaria, malaria parasite secretes a variety of proteins for its growth and reproduction. The identification of the secretory proteins of malaria parasite has crucial reference significance for the development of anti-malaria vaccines as well as medicine. In this study, a computational classification method was developed to identify the secreted proteins of Plasmodium. Amino acid composition, dipeptide composition, and tripeptide composition as well as reduced amino acids alphabets were proposed to illuminate protein sequences. We further used SVM to train and predict respectively and optimized the features. 74 types of reduced amino acids alphabets were employed to predict secretory proteins. The results showed that the accuracy improved to 91.67% with 0.84 Mathew's correlation coefficient (MCC) by dipeptide composition, and the highest prediction accuracy reached 92.26% after feature selection, which demonstrated that our method is prominent and reliable in the field of malaria parasite secreted proteins prediction. A intuitive web server iSP-RAAC (http://bioinfor.imu.edu.cn/isppseraac) was established for the convenience of most experimental scientists.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call