Abstract

Protein–protein interactions (PPIs) in plants play an essential role in the regulation of biological processes. However, traditional experimental methods are expensive, time-consuming, and need sophisticated technical equipment. These drawbacks motivated the development of novel computational approaches to predict PPIs in plants. In this article, a new deep learning framework, which combined the discrete Hilbert transform (DHT) with deep neural networks (DNN), was presented to predict PPIs in plants. To be more specific, plant protein sequences were first transformed as a position-specific scoring matrix (PSSM). Then, DHT was employed to capture features from the PSSM. To improve the prediction accuracy, we used the singular value decomposition algorithm to decrease noise and reduce the dimensions of the feature descriptors. Finally, these feature vectors were fed into DNN for training and predicting. When performing our method on three plant PPI datasets Arabidopsis thaliana, maize, and rice, we achieved good predictive performance with average area under receiver operating characteristic curve values of 0.8369, 0.9466, and 0.9440, respectively. To fully verify the predictive ability of our method, we compared it with different feature descriptors and machine learning classifiers. Moreover, to further demonstrate the generality of our approach, we also test it on the yeast and human PPI dataset. Experimental results anticipated that our method is an efficient and promising computational model for predicting potential plant–protein interacted pairs.

Highlights

  • Identification of protein–protein interactions (PPIs) in plants is essential for exploring the mechanisms underlying of biological processes, such as organ formation, homeostasis control (Canovas et al, 2004), plant defense (Zhang et al, 2010), signal transduction (Khan and Kihara, 2016), and stress response (Bracha-Drori et al, 2004)

  • discrete cosine transform (DCT)+deep neural networks (DNN) fast Fourier transform (FFT)+DNN discrete wavelet transform (DWT)+DNN AC+DNN Our method be randomly split into five equal parts; four of them will be represented by employed for training and the remaining one was used for testing

  • To further verify the predictive ability of DNN classifier, we compared it with the K-nearest neighbor (KNN) and random forest (RF) model by the five-fold CV scheme and adopted the same discrete Hilbert transform (DHT) feature descriptors

Read more

Summary

Introduction

Identification of protein–protein interactions (PPIs) in plants is essential for exploring the mechanisms underlying of biological processes, such as organ formation, homeostasis control (Canovas et al, 2004), plant defense (Zhang et al, 2010), signal transduction (Khan and Kihara, 2016), and stress response (Bracha-Drori et al, 2004). Predicting PPI in Plants mass spectrometry (Fukao, 2012; Armean et al, 2013) and yeast two-hybrid (Causier and Davies, 2002; Fang et al, 2002), these approaches are cumbersome, costly, time consuming, and always suffer from high false positive rate. To overcome these problems, there is an urgent need to develop sequence-based computational methods that can accurately predict potential PPIs while analyzing the functions of plant genes. It is meaningful to develop computational methods to predict potential PPIs from sequence information

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call