The protein structure prediction problem could be solved using the current PDB library.

Yang Zhang,Jeffrey Skolnick

doi:10.1073/pnas.0407152101

Abstract

For single-domain proteins, we examine the completeness of the structures in the current Protein Data Bank (PDB) library for use in full-length model construction of unknown sequences. To address this issue, we employ a comprehensive benchmark set of 1,489 medium-size proteins that cover the PDB at the level of 35% sequence identity and identify templates by structure alignment. With homologous proteins excluded, we can always find similar folds to native with an average rms deviation (RMSD) from native of 2.5 A with approximately 82% alignment coverage. These template structures often contain a significant number of insertions/deletions. The tasser algorithm was applied to build full-length models, where continuous fragments are excised from the top-scoring templates and reassembled under the guide of an optimized force field, which includes consensus restraints taken from the templates and knowledge-based statistical potentials. For almost all targets (except for 2/1,489), the resultant full-length models have an RMSD to native below 6 A (97% of them below 4 A). On average, the RMSD of full-length models is 2.25 A, with aligned regions improved from 2.5 A to 1.88 A, comparable with the accuracy of low-resolution experimental structures. Furthermore, starting from state-of-the-art structural alignments, we demonstrate a methodology that can consistently bring template-based alignments closer to native. These results are highly suggestive that the protein-folding problem can in principle be solved based on the current PDB library by developing efficient fold recognition algorithms that can recover such initial alignments.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The protein structure prediction problem could be solved using the current PDB library.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America

Lead the way for us

Journal: Proceedings of the National Academy of Sciences of the United States of America	Publication Date: Jan 14, 2005
Citations: 283

Similar Papers

Segment assembly, structure alignment and iterative simulation in protein structure prediction
Yang Zhang ... Jeffrey Skolnick
BMC biology | VOL. 11
Yang Zhang, et. al.Yang Zhang ... Jeffrey Skolnick
15 Apr 2013
BMC biology | VOL. 11

The PDB is a Covering Set of Small Protein Structures
Daisuke Kihara ... Jeffrey Skolnick
Journal of molecular biology | VOL. 334
Daisuke Kihara, et. al.Daisuke Kihara ... Jeffrey Skolnick
19 Nov 2003
Journal of molecular biology | VOL. 334

Automated structure prediction of weakly homologous proteins on a genomic scale.
Yang Zhang ... Jeffrey Skolnick
Proceedings of the National Academy of Sciences of the United States of America | VOL. 101
Yang Zhang, et. al.Yang Zhang ... Jeffrey Skolnick
04 May 2004
Proceedings of the National Academy of Sciences of the United States of America | VOL. 101

TM-align: a protein structure alignment algorithm based on the TM-score
Y Zhang
Nucleic acids research | VOL. 33
Y ZhangY Zhang
11 Apr 2005
Nucleic acids research | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The protein structure prediction problem could be solved using the current PDB library.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America