Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition.

Yu-An Huang,Zhu-Hong You,Xing Chen,Gui-Ying Yan

doi:10.1186/s12918-016-0360-6

Yu-An Huang, Zhu-Hong You + Show 2 more

Open Access

PDF Available

https://doi.org/10.1186/s12918-016-0360-6

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

BackgroundProtein-protein interactions (PPIs) are essential to most biological processes. Since bioscience has entered into the era of genome and proteome, there is a growing demand for the knowledge about PPI network. High-throughput biological technologies can be used to identify new PPIs, but they are expensive, time-consuming, and tedious. Therefore, computational methods for predicting PPIs have an important role. For the past years, an increasing number of computational methods such as protein structure-based approaches have been proposed for predicting PPIs. The major limitation in principle of these methods lies in the prior information of the protein to infer PPIs. Therefore, it is of much significance to develop computational methods which only use the information of protein amino acids sequence.ResultsHere, we report a highly efficient approach for predicting PPIs. The main improvements come from the use of a novel protein sequence representation by combining continuous wavelet descriptor and Chou’s pseudo amino acid composition (PseAAC), and from adopting weighted sparse representation based classifier (WSRC). This method, cross-validated on the PPIs datasets of Saccharomyces cerevisiae, Human and H. pylori, achieves an excellent results with accuracies as high as 92.50%, 95.54% and 84.28% respectively, significantly better than previously proposed methods. Extensive experiments are performed to compare the proposed method with state-of-the-art Support Vector Machine (SVM) classifier.ConclusionsThe outstanding results yield by our model that the proposed feature extraction method combing two kinds of descriptors have strong expression ability and are expected to provide comprehensive and effective information for machine learning-based classification models. In addition, the prediction performance in the comparison experiments shows the well cooperation between the combined feature and WSRC. Thus, the proposed method is a very efficient method to predict PPIs and may be a useful supplementary tool for future proteomics studies.

Highlights

Protein-protein interactions (PPIs) are essential to most biological processes
We report a novel computational method for predicting protein-protein interactions based on amino acid sequences by using the classifier of weighted sparse representation based classifier (WSRC) and the combined features consisting of CW-Local binary pattern (LBP) and pseudo amino acid composition (PseAAC) descriptors
In the proposed model, the protein features are extracted by using the transformations of numerical sequences, continuous wavelet and Local Binary Pattern Histogram Fourier. This feature extraction method is mainly based on the assumptions that the information of protein sequences can provide enough information for predicting protein-protein interactions and the fact that the hydrophobicity character of protein influences the protein interacting process

Summary

Introduction

Protein-protein interactions (PPIs) are essential to most biological processes. Since bioscience has entered into the era of genome and proteome, there is a growing demand for the knowledge about PPI network. It is of much significance to develop computational methods which only use the information of protein amino acids sequence In this post-genomic era, protein, as the major component of organism, is widely studied because of its important role in most cell functions including DNA transcription and replication, metabolic cycles, and signaling cascades. Efforts have been devoted to the development of experimental methods for detecting PPIs and constructing protein interaction networks, such as yeast two-hybrid (Y2H) [1, 2] screens, tandem affinity purification (TAP) [3], mass spectrometric protein complex identification (MS-PCI) [3] and other high-throughput biological techniques for PPIs detection. For the sake of detecting larger fraction of the whole PPI network and utilizing the valuable and vast biological data provided by experimental methods, there is a growing need to develop computational methods capable of identifying PPIs

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Systems Biology	Publication Date: Dec 1, 2016
Citations: 26	License type: cc-by

R Discovery Prime

Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Systems Biology

Lead the way for us

Similar Papers

Prediction of GABA A receptor proteins using the concept of Chou's pseudo-amino acid composition and support vector machine
Hassan Mohabatkar ... Abolghasem Esmaeili
Journal of Theoretical Biology | VOL. 281
Hassan Mohabatkar, et. al.Hassan Mohabatkar ... Abolghasem Esmaeili
28 Apr 2011
Journal of Theoretical Biology | VOL. 281

Using Weighted Sparse Representation Model Combined with Discrete Cosine Transformation to Predict Protein-Protein Interactions from Protein Sequence.
Yu-An Huang ... Lirong Wang
BioMed Research International | VOL. 2015
Yu-An Huang, et. al.Yu-An Huang ... Lirong Wang
01 Jan 2015
BioMed Research International | VOL. 2015

Prediction of metalloproteinase family based on the concept of Chou’s pseudo amino acid composition using a machine learning approach
Majid Mohammad Beigi ... Hassan Mohabatkar
Journal of Structural and Functional Genomics | VOL. 12
Majid Mohammad Beigi, et. al.Majid Mohammad Beigi ... Hassan Mohabatkar
01 Dec 2011
Journal of Structural and Functional Genomics | VOL. 12

Prediction of β-lactamase and its class by Chou's pseudo-amino acid composition and support vector machine.
Ravindra Kumar ... Bandana Kumari
Journal of Theoretical Biology | VOL. 365
Ravindra Kumar, et. al.Ravindra Kumar ... Bandana Kumari
22 Oct 2014
Journal of Theoretical Biology | VOL. 365

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Systems Biology