Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis

Gergely Csaba,Ralf Zimmer,Fabian Birzele

doi:10.1186/1472-6807-9-23

Abstract

BackgroundSCOP and CATH are widely used as gold standards to benchmark novel protein structure comparison methods as well as to train machine learning approaches for protein structure classification and prediction. The two hierarchies result from different protocols which may result in differing classifications of the same protein. Ignoring such differences leads to problems when being used to train or benchmark automatic structure classification methods. Here, we propose a method to compare SCOP and CATH in detail and discuss possible applications of this analysis.ResultsWe create a new mapping between SCOP and CATH and define a consistent benchmark set which is shown to largely reduce errors made by structure comparison methods such as TM-Align and has useful further applications, e.g. for machine learning methods being trained for protein structure classification. Additionally, we extract additional connections in the topology of the protein fold space from the orthogonal features contained in SCOP and CATH.ConclusionVia an all-to-all comparison, we find that there are large and unexpected differences between SCOP and CATH w.r.t. their domain definitions as well as their hierarchic partitioning of the fold space on every level of the two classifications. A consistent mapping of SCOP and CATH can be exploited for automated structure comparison and classification.AvailabilityBenchmark sets and an interactive SCOP-CATH browser are available at .

Highlights

SCOP and CATH are widely used as gold standards to benchmark novel protein structure comparison methods as well as to train machine learning approaches for protein structure classification and prediction
A consistent mapping of SCOP and CATH can be exploited for automated structure comparison and classification
Availability: Benchmark sets and an interactive SCOP-CATH browser are available at http:// www.bio.ifi.lmu.de/SCOPCath

Summary

Introduction

SCOP and CATH are widely used as gold standards to benchmark novel protein structure comparison methods as well as to train machine learning approaches for protein structure classification and prediction. The two hierarchies result from different protocols which may result in differing classifications of the same protein Ignoring such differences leads to problems when being used to train or benchmark automatic structure classification methods. The two most prominent protein structure classification schemes are SCOP [2] and CATH [3]. The SCOP database is mainly based on expert knowledge and, on the first level of the hierarchy, defines four major classes namely all α, all β, α/β as well as α + β describing the content of secondary structure elements in the domain.

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Structural Biology	Publication Date: Apr 17, 2009
Citations: 94	License type: cc-by

R Discovery Prime

R Discovery Prime

Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Structural Biology

Lead the way for us

Similar Papers

Chapter 4 - Protein Structure Alignment Using Evolutionary Computation
Joseph D Szustakowski ... Zhiping Weng
Evolutionary Computation in Bioinformatics | VOL. -
Joseph D Szustakowski, et. al.Joseph D Szustakowski ... Zhiping Weng
01 Jan 2003
Evolutionary Computation in Bioinformatics | VOL. -

Recognition of Structure Similarities in Proteins
Lin Wang ... Yuqing Qiu
Journal of Systems Science and Complexity | VOL. 21
Lin Wang, et. al.Lin Wang ... Yuqing Qiu
14 Nov 2008
Journal of Systems Science and Complexity | VOL. 21

Rapid protein structure classification using one-dimensional structure profiles on the bioSCAN parallel computer.
D.L Hoffman ... A Tropsha
Computer applications in the biosciences : CABIOS | VOL. 11
D.L Hoffman, et. al.D.L Hoffman ... A Tropsha
01 Jan 1995
Computer applications in the biosciences : CABIOS | VOL. 11

Comparison of protein structures using 3D profile alignment.
Mikita Suyama ... Yo Matsuo
Journal of molecular evolution | VOL. Suppl 44 1
Mikita Suyama, et. al.Mikita Suyama ... Yo Matsuo
01 Jan 1997
Journal of molecular evolution | VOL. Suppl 44 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Structural Biology