Deep template-based protein structure prediction.

Fandi Wu,Jinbo Xu

doi:10.1371/journal.pcbi.1008954

Abstract

Protein structure prediction has been greatly improved by deep learning, but most efforts are devoted to template-free modeling. But very few deep learning methods are developed for TBM (template-based modeling), a popular technique for protein structure prediction. TBM has been studied extensively in the past, but its accuracy is not satisfactory when highly similar templates are not available. This paper presents a new method NDThreader (New Deep-learning Threader) to address the challenges of TBM. NDThreader first employs DRNF (deep convolutional residual neural fields), which is an integration of deep ResNet (convolutional residue neural networks) and CRF (conditional random fields), to align a query protein to templates without using any distance information. Then NDThreader uses ADMM (alternating direction method of multipliers) and DRNF to further improve sequence-template alignments by making use of predicted distance potential. Finally, NDThreader builds 3D models from a sequence-template alignment by feeding it and sequence coevolution information into a deep ResNet to predict inter-atom distance distribution, which is then fed into PyRosetta for 3D model construction. Our experimental results show that NDThreader greatly outperforms existing methods such as CNFpred, HHpred, DeepThreader and CEthreader. NDThreader was blindly tested in CASP14 as a part of RaptorX server, which obtained the best average GDT score among all CASP14 servers on the 58 TBM targets.

Highlights

Predicting protein structure from its amino acid sequence is one of the most challenging problems in the field of computational biology
NDThreader first employs DRNF, which is an integration of deep residual neural network (ResNet) and CRF, to align a query protein to templates without using any distance information
NDThreader builds 3D models from a sequence-template alignment by feeding it and sequence coevolution information into a deep ResNet to predict inter-atom distance distribution, which is fed into PyRosetta for 3D model construction

Summary

Introduction

Predicting protein structure from its amino acid sequence is one of the most challenging problems in the field of computational biology. Template-based modeling (TBM), including protein threading and homology modeling, is a popular method for protein tertiary structure prediction. TBM predicts the structure of a query protein (called target) by aligning it to one or multiple templates with solved structures. Along with the growth of the PDB (Protein Data Bank), TBM is able to predict structures for a good percentage of proteins [1]. In CASP13 67 out of 112 test domains and in CASP14 58 out of 107 test domains have reasonable templates in PDB. When a protein under prediction does not have highly similar templates, TBM faces three major challenges: selection of the best templates, building an accurate sequence-template alignment, and constructing 3D models from the alignment

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS computational biology	Publication Date: May 3, 2021
Citations: 25	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Deep template-based protein structure prediction.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS computational biology

Lead the way for us

Similar Papers

Recursive protein modeling: A divide and conquer strategy for protein structure prediction and its case study in CASP9
Jianlin Cheng ... J Eickholt
-
Jianlin Cheng, et. al. Jianlin Cheng ... J Eickholt
01 Nov 2011
01 Nov 2011

RECURSIVE PROTEIN MODELING: A DIVIDE AND CONQUER STRATEGY FOR PROTEIN STRUCTURE PREDICTION AND ITS CASE STUDY IN CASP9
Jianlin Cheng ... Xin Deng
Journal of Bioinformatics and Computational Biology | VOL. 10
Jianlin Cheng, et. al.Jianlin Cheng ... Xin Deng
01 Jun 2012
Journal of Bioinformatics and Computational Biology | VOL. 10

ProALIGN: Directly Learning Alignments for Protein Structure Prediction via Exploiting Context-Specific Alignment Motifs.
Lupeng Kong ... Jinbo Xu
Journal of computational biology : a journal of computational molecular cell biology | VOL. 29
Lupeng Kong, et. al.Lupeng Kong ... Jinbo Xu
21 Jan 2022
Journal of computational biology : a journal of computational molecular cell biology | VOL. 29

MULTICOM2 open-source protein structure prediction system powered by deep learning and distance prediction
Tianqi Wu ... Jie Hou
Scientific Reports | VOL. 11
Tianqi Wu, et. al.Tianqi Wu ... Jie Hou
23 Jun 2021
Scientific Reports | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep template-based protein structure prediction.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS computational biology