COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.

Md Selim Reza,Yanjie Wei,Shengzhong Feng,Md Tofazzal Hossain,Langxi Jin,Huiling Zhang

doi:10.3390/membranes11070503

Abstract

Protein contact prediction helps reconstruct the tertiary structure that greatly determines a protein’s function; therefore, contact prediction from the sequence is an important problem. Recently there has been exciting progress on this problem, but many of the existing methods are still low quality of prediction accuracy. In this paper, we present a new mixed integer linear programming (MILP)-based consensus method: a Consensus scheme based On a Mixed integer linear opTimization method for prOtein contact Prediction (COMTOP). The MILP-based consensus method combines the strengths of seven selected protein contact prediction methods, including CCMpred, EVfold, DeepCov, NNcon, PconsC4, plmDCA, and PSICOV, by optimizing the number of correctly predicted contacts and achieving a better prediction accuracy. The proposed hybrid protein residue–residue contact prediction scheme was tested in four independent test sets. For 239 highly non-redundant proteins, the method showed a prediction accuracy of 59.68%, 70.79%, 78.86%, 89.04%, 94.51%, and 97.35% for top-5L, top-3L, top-2L, top-L, top-L/2, and top-L/5 contacts, respectively. When tested on the CASP13 and CASP14 test sets, the proposed method obtained accuracies of 75.91% and 77.49% for top-L/5 predictions, respectively. COMTOP was further tested on 57 non-redundant α-helical transmembrane proteins and achieved prediction accuracies of 64.34% and 73.91% for top-L/2 and top-L/5 predictions, respectively. For all test datasets, the improvement of COMTOP in accuracy over the seven individual methods increased with the increasing number of predicted contacts. For example, COMTOP performed much better for large number of contact predictions (such as top-5L and top-3L) than for small number of contact predictions such as top-L/2 and top-L/5. The results and analysis demonstrate that COMTOP can significantly improve the performance of the individual methods; therefore, COMTOP is more robust against different types of test sets. COMTOP also showed better/comparable predictions when compared with the state-of-the-art predictors.

Highlights

This article is an open access articleProtein contact prediction aims at predicting which residues of a protein are in contact.Two non-local residues are far away from each other in the protein primary structure, but they are close to each other in the 3D structure
When tested on CASP13, CASP14, and 57 non-redundant TM proteins, the consensus method achieved accuracies of 75.91%, 77.49%, and 73.91% for top-L/5 predictions, which was better than the seven individual methods and could achieve state-of-the-art prediction performance
We presented a novel hybrid consensus method named as COMTOP

Summary

Introduction

Protein contact prediction aims at predicting which residues of a protein are in contact. Two non-local residues are far away from each other in the protein primary structure, but they are close to each other in the 3D structure. A protein contact map is a 2D representation of a protein’s 3D structure. Contact map information can be used as distance restraints to guide protein structure modeling [4,5,6,7,8,9,10]. This creates a new direction for solving the grand challenge of the de novo protein structure. The idea of residue–residue contact prediction and using it to predict 3D models was introduced around two decades ago [11,12]; the realization of that idea has only recently gained much attention by the community and has come into practice as many authors have shown how residue contacts can be predicted with reasonable accuracy [13,14,15,16,17,18,19,20]

Methods

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Membranes	Publication Date: Jun 30, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Membranes

Lead the way for us

Similar Papers

CONCORD: a consensus method for protein secondary structure prediction via mixed integer linear optimization
Y Wei ... J Thompson
Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences | VOL. 468
Y Wei, et. al.Y Wei ... J Thompson
18 Nov 2011
Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences | VOL. 468

DeepECA: an end-to-end learning framework for protein contact prediction from a multiple sequence alignment
Hiroyuki Fukuda ... Kentaro Tomii
BMC Bioinformatics | VOL. 21
Hiroyuki Fukuda, et. al.Hiroyuki Fukuda ... Kentaro Tomii
09 Jan 2020
BMC Bioinformatics | VOL. 21

RBO Aleph: leveraging novel information sources for protein structure prediction.
Mahmoud Mabrouk ... Tim Werner
Nucleic acids research | VOL. 43
Mahmoud Mabrouk, et. al.Mahmoud Mabrouk ... Tim Werner
20 Apr 2015
Nucleic acids research | VOL. 43

COMSAT: Residue contact prediction of transmembrane proteins based on support vector machines and mixed integer linear programming.
Huiling Zhang ... Christodoulos A Floudas
Proteins | VOL. 84
Huiling Zhang, et. al.Huiling Zhang ... Christodoulos A Floudas
20 Jan 2016
Proteins | VOL. 84

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Membranes