Residue Contact Prediction Research Articles

Protein-protein interaction plays an important role in life activities. A more fine-grained analysis, such as residues and atoms level, will better benefit us to understand the mechanism for inter-protein interaction and drug design. The development of efficient computational methods to reduce trials and errors, as well as assisting experimental researchers to determine the complex structure are some of the ongoing studies in the field. The research of trimer protein interface, especially homotrimer, has been rarely studied. In this paper, we proposed an interpretable machine learning method for homo-trimeric protein interface residue pairs prediction. The structure, sequence, and physicochemical information are intergraded as feature input fed to model for training. Graph model is utilized to present spatial information for intra-protein. Matrix factorization captures the different features' interactions. Kernel function is designed to auto-acquire the adjacent information of our target residue pairs. The accuracy rate achieves 54.5% in an independent test set. Sequence and structure alignment exhibit the ability of model self-study. Our model indicates the biological significance between sequence and structure, and could be auxiliary for reducing trials and errors in the fields of protein complex determination and protein-protein docking, etc. SignificanceProtein complex structures are significant for understanding protein function and promising functional protein design. With data increasing, some computational tools have been developed for protein complex residue contact prediction, which is one of the most significant steps for complex structure prediction. But for homo-trimeric protein, the sequence-based deep learning predictors are infeasible for homologous sequences, and the algorithm black box prevents us from understanding of each step operation. In this way, we propose an interpreting machine learning method for homo-trimeric protein interface residue-residue interaction prediction, and the predictor shows a good performance. Our work provides a computational auxiliary way for determining the homo-trimeric proteins interface residue pairs which will be further verified by wet experiments, and and gives a hand for the downstream works, such as protein-protein docking, protein complex structure prediction and drug design.

Protein contact prediction helps reconstruct the tertiary structure that greatly determines a protein’s function; therefore, contact prediction from the sequence is an important problem. Recently there has been exciting progress on this problem, but many of the existing methods are still low quality of prediction accuracy. In this paper, we present a new mixed integer linear programming (MILP)-based consensus method: a Consensus scheme based On a Mixed integer linear opTimization method for prOtein contact Prediction (COMTOP). The MILP-based consensus method combines the strengths of seven selected protein contact prediction methods, including CCMpred, EVfold, DeepCov, NNcon, PconsC4, plmDCA, and PSICOV, by optimizing the number of correctly predicted contacts and achieving a better prediction accuracy. The proposed hybrid protein residue–residue contact prediction scheme was tested in four independent test sets. For 239 highly non-redundant proteins, the method showed a prediction accuracy of 59.68%, 70.79%, 78.86%, 89.04%, 94.51%, and 97.35% for top-5L, top-3L, top-2L, top-L, top-L/2, and top-L/5 contacts, respectively. When tested on the CASP13 and CASP14 test sets, the proposed method obtained accuracies of 75.91% and 77.49% for top-L/5 predictions, respectively. COMTOP was further tested on 57 non-redundant α-helical transmembrane proteins and achieved prediction accuracies of 64.34% and 73.91% for top-L/2 and top-L/5 predictions, respectively. For all test datasets, the improvement of COMTOP in accuracy over the seven individual methods increased with the increasing number of predicted contacts. For example, COMTOP performed much better for large number of contact predictions (such as top-5L and top-3L) than for small number of contact predictions such as top-L/2 and top-L/5. The results and analysis demonstrate that COMTOP can significantly improve the performance of the individual methods; therefore, COMTOP is more robust against different types of test sets. COMTOP also showed better/comparable predictions when compared with the state-of-the-art predictors.

Residue Contact Prediction Research Articles

Related Topics

Articles published on Residue Contact Prediction

Improving AlphaFold Predicted Contacts for Alpha-Helical Transmembrane Proteins Using Structural Features.

The Relative Distance Prediction of Transmembrane Protein Surface Residue Based on Improved Residual Networks

Protein Residue Contact Prediction Based on Deep Learning and Massive Statistical Features from Multi-Sequence Alignment

Applications of residue contact predictions in structural biology

An interpretable machine learning method for homo-trimeric protein interface residue-residue interaction prediction

COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.

Evaluation of residue-residue contact prediction methods: From retrospective to prospective.

ModFOLD8: accurate global and local quality estimates for 3D protein models.

Protein contact map refinement for improving structure prediction using generative adversarial networks.

Fold recognition by scoring protein maps using the congruence coefficient.

DeepHelicon: Accurate prediction of inter-helical residue contacts in transmembrane proteins by residual neural networks

SSCpred: Single-Sequence-Based Protein Contact Prediction Using Deep Fully Convolutional Network.

Protein contact prediction using metagenome sequence data and residual neural networks.

BetaDL: A protein beta-sheet predictor utilizing a deep learning model and independent set solution

Bio-knowledge-based filters improve residue-residue contact prediction accuracy.

Identification of residue pairing in interacting \u03b2-strands from a predicted residue contact map

Three-body interactions improve contact prediction within direct-coupling analysis.

MemBrain-contact 2.0: a new two-stage machine learning model for the prediction enhancement of transmembrane protein residue contacts in the full chain.

Predicting accurate contacts in thousands of Pfam domain families using PconsC3.

A deep learning framework for improving long-range residue-residue contact prediction using a hierarchical strategy.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Residue Contact Prediction Research Articles

Related Topics

Articles published on Residue Contact Prediction

Improving AlphaFold Predicted Contacts for Alpha-Helical Transmembrane Proteins Using Structural Features.

The Relative Distance Prediction of Transmembrane Protein Surface Residue Based on Improved Residual Networks

Protein Residue Contact Prediction Based on Deep Learning and Massive Statistical Features from Multi-Sequence Alignment

Applications of residue contact predictions in structural biology

An interpretable machine learning method for homo-trimeric protein interface residue-residue interaction prediction

COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.

Evaluation of residue-residue contact prediction methods: From retrospective to prospective.

ModFOLD8: accurate global and local quality estimates for 3D protein models.

Protein contact map refinement for improving structure prediction using generative adversarial networks.

Fold recognition by scoring protein maps using the congruence coefficient.

DeepHelicon: Accurate prediction of inter-helical residue contacts in transmembrane proteins by residual neural networks

SSCpred: Single-Sequence-Based Protein Contact Prediction Using Deep Fully Convolutional Network.

Protein contact prediction using metagenome sequence data and residual neural networks.

BetaDL: A protein beta-sheet predictor utilizing a deep learning model and independent set solution

Bio-knowledge-based filters improve residue-residue contact prediction accuracy.

Identification of residue pairing in interacting \u03b2-strands from a predicted residue contact map

Three-body interactions improve contact prediction within direct-coupling analysis.

MemBrain-contact 2.0: a new two-stage machine learning model for the prediction enhancement of transmembrane protein residue contacts in the full chain.

Predicting accurate contacts in thousands of Pfam domain families using PconsC3.

A deep learning framework for improving long-range residue-residue contact prediction using a hierarchical strategy.