Abstract

Studying the network of protein-protein interactions (PPIs) will provide valuable insights into the inner workings of cells. It is vitally important to develop an automated, high-throughput tool that efficiently predicts protein-protein interactions. This study proposes a new model for PPI prediction based on the concept of chaos game representation and the wavelet transform, which means that a considerable amount of sequence-order effects can be incorporated into a set of discrete numbers. The advantage of using chaos game representation and the wavelet transform to formulate the protein sequence is that it can more effectively reflect its overall sequence-order characteristics than the conventional correlation factors. Using such a formulation frame to represent the protein sequences means that the random forest algorithm can be used to conduct the prediction. The results for a large-scale independent test dataset show that the proposed model can achieve an excellent performance with an accuracy value of about 0.86 and a geometry mean value of about 0.85. The model is therefore a useful supplementary tool for PPI predictions. The predictor used in this article is freely available at http://www.jci-bioinfo.cn/PPI.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call