CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information.

Kriti Chopra,Ajit Kembhavi,Kaushal Sharma,Radha Chauhan,Shekhar C Mande,Bhawna Burdak

doi:10.3390/biom10060938

Abstract

Decrypting the interface residues of the protein complexes provides insight into the functions of the proteins and, hence, the overall cellular machinery. Computational methods have been devised in the past to predict the interface residues using amino acid sequence information, but all these methods have been majorly applied to predict for prokaryotic protein complexes. Since the composition and rate of evolution of the primary sequence is different between prokaryotes and eukaryotes, it is important to develop a method specifically for eukaryotic complexes. Here, we report a new hybrid pipeline for predicting the protein-protein interaction interfaces in a pairwise manner from the amino acid sequence information of the interacting proteins. It is based on the framework of Co-evolution, machine learning (Random Forest), and Network Analysis named CoRNeA trained specifically on eukaryotic protein complexes. We use Co-evolution, physicochemical properties, and contact potential as major group of features to train the Random Forest classifier. We also incorporate the intra-contact information of the individual proteins to eliminate false positives from the predictions keeping in mind that the amino acid sequence of a protein also holds information for its own folding and not only the interface propensities. Our prediction on example datasets shows that CoRNeA not only enhances the prediction of true interface residues but also reduces false positive rates significantly.

Highlights

The biological machinery performs its cellular functions when its basic units, such as DNA, RNA, and proteins, interact with each other
The other features derived for the Random Forest classifier are based on the physicochemical properties of the amino acids which depend on their side chain structure, such as charge, size and hydrophobe compatibility, secondary structure information, and relative solvent accessibility, were derived using amino acid sequence information
Random Forest classifier is a tree-structure based algorithm where the classification rules are learned based on the feature values and their target class provided while training

Summary

Introduction

The biological machinery performs its cellular functions when its basic units, such as DNA, RNA, and proteins, interact with each other. There are various experimental methods known for examining these interactions such as yeast two hybrid (Y2H) [1], co-immunoprecipitation (co-IP) [2], mass spectrometry [3], etc., which provide information only about the domains necessary for maintaining the interaction or the proximity of the interactions. These methods are labor, cost and time intensive. Deciphering the PPII (Protein-Protein Interaction Interfaces) at the highest resolution through x-ray crystallography or cryo-electron microscopy methods is even more challenging due to their intrinsic technical difficulties

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biomolecules	Publication Date: Jun 22, 2020
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biomolecules

Lead the way for us

Similar Papers

Mass spectrometry-based proteomic analysis of the epitope-tag affinity purified protein complexes in eukaryotes
Ing-Feng Chang
PROTEOMICS | VOL. 6
Ing-Feng ChangIng-Feng Chang
30 Oct 2006
PROTEOMICS | VOL. 6

Cotranslational assembly of protein complexes in eukaryotes revealed by ribosome profiling
Ayala Shiber ... Mostafa Zedan
Nature | VOL. 561
Ayala Shiber, et. al.Ayala Shiber ... Mostafa Zedan
29 Aug 2018
Nature | VOL. 561

Structure-based prediction of protein-protein interaction sites
...
-
, et. al. ...
31 Oct 2012
31 Oct 2012

Predicting DNA-Binding Residues of Proteins Using Random Forest and Evolutionary Information Combined with Conservation Information
Xin Ma ... Jian-Ming Xie
-
Xin Ma, et. al.Xin Ma ... Jian-Ming Xie
01 May 2011
01 May 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biomolecules