A Novel Ensemble Learning-Based Computational Method to Predict Protein-Protein Interactions from Protein Primary Sequences.

Jie Pan,Shiwei Wang,Yanmei Sun,Liping Li,Zhuhong You,Changqing Yu

doi:10.3390/biology11050775

Abstract

Simple SummaryProtein–protein interactions (PPIs) play a central role in the evolution and progression of various biological processes. In this article, we constructed a novel ensemble-learning-based model to predict potential PPIs, which only utilized the protein sequence information. The presented method used Discrete Hilbert transform to extract amino acid sequence information from position-specific scoring matrices. Then these extracted features were fed into rotation forest for training and predicting. When applying our method to the three datasets (Yeast, Human, and Oryza sativa) for detecting PPIs, we obtained excellent prediction performance. Furthermore, the comparison results indicated that our computational model is effective and robust in predicting potential PPI pairs.Protein–protein interactions (PPIs) are crucial for understanding the cellular processes, including signal cascade, DNA transcription, metabolic cycles, and repair. In the past decade, a multitude of high-throughput methods have been introduced to detect PPIs. However, these techniques are time-consuming, laborious, and always suffer from high false negative rates. Therefore, there is a great need of new computational methods as a supplemental tool for PPIs prediction. In this article, we present a novel sequence-based model to predict PPIs that combines Discrete Hilbert transform (DHT) and Rotation Forest (RoF). This method contains three stages: firstly, the Position-Specific Scoring Matrices (PSSM) was adopted to transform the amino acid sequence into a PSSM matrix, which can contain rich information about protein evolution. Then, the 400-dimensional DHT descriptor was constructed for each protein pair. Finally, these feature descriptors were fed to the RoF classifier for identifying the potential PPI class. When exploring the proposed model on the Yeast, Human, and Oryza sativa PPIs datasets, it yielded excellent prediction accuracies of 91.93, 96.35, and 94.24%, respectively. In addition, we also conducted numerous experiments on cross-species PPIs datasets, and the predictive capacity of our method is also very excellent. To further access the prediction ability of the proposed approach, we present the comparison of RoF with four powerful classifiers, including Support Vector Machine (SVM), Random Forest (RF), K-nearest Neighbor (KNN), and AdaBoost. We also compared it with some existing superiority works. These comprehensive experimental results further confirm the excellent and feasibility of the proposed approach. In future work, we hope it can be a supplemental tool for the proteomics analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biology	Publication Date: May 19, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Novel Ensemble Learning-Based Computational Method to Predict Protein-Protein Interactions from Protein Primary Sequences.

Abstract

Talk to us

Similar Papers

More From: Biology

Lead the way for us

Similar Papers

A CASE-BASED REASONING SYSTEM FOR GENOTYPIC PREDICTION OF HIV-1 CO-RECEPTOR TROPISM
Mark C Evans ... Matthew Bidwell Goetz
Journal of Bioinformatics and Computational Biology | VOL. 11
Mark C Evans, et. al.Mark C Evans ... Matthew Bidwell Goetz
16 Jul 2013
Journal of Bioinformatics and Computational Biology | VOL. 11

Performance of rotation forest ensemble classifier and feature extractor in predicting protein interactions using amino acid sequences
Alhadi Bustamam ... Patuan P Tampubolon
BMC Genomics | VOL. 20
Alhadi Bustamam, et. al.Alhadi Bustamam ... Patuan P Tampubolon
01 Dec 2019
BMC Genomics | VOL. 20

Detection of congestive heart failure from short-term heart rate variability segments using hybrid feature selection approach
Alan Jovic ... Goran Krstacic
Biomedical Signal Processing and Control | VOL. 53
Alan Jovic, et. al.Alan Jovic ... Goran Krstacic
18 Jun 2019
Biomedical Signal Processing and Control | VOL. 53

Robust and accurate prediction of protein\u2013protein interactions by exploiting evolutionary information
Yang Li ... Li-Ping Li
Scientific Reports | VOL. 11
Yang Li, et. al.Yang Li ... Li-Ping Li
19 Aug 2021
Robust and accurate prediction of protein\u2013protein interactions by exploiting evolutionary information
Yang Li ... Li-Ping Li

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Ensemble Learning-Based Computational Method to Predict Protein-Protein Interactions from Protein Primary Sequences.

Abstract

Talk to us

Similar Papers

More From: Biology