Accurate Classification of Biological and non-Biological Interfaces in Protein Crystal Structures using Subtle Covariation Signals

Yoshinori Fukasawa,Kentaro Tomii

doi:10.1038/s41598-019-48913-8

Yoshinori Fukasawa, Kentaro Tomii

Open Access

https://doi.org/10.1038/s41598-019-48913-8

Copy DOI

Abstract

Proteins often work as oligomers or multimers in vivo. Therefore, elucidating their oligomeric or multimeric form (quaternary structure) is crucially important to ascertain their function. X-ray crystal structures of numerous proteins have been accumulated, providing information related to their biological units. Extracting information of biological units from protein crystal structures represents a meaningful task for modern biology. Nevertheless, although many methods have been proposed for identifying biological units appearing in protein crystal structures, it is difficult to distinguish biological protein–protein interfaces from crystallographic ones. Therefore, our simple but highly accurate classifier was developed to infer biological units in protein crystal structures using large amounts of protein sequence information and a modern contact prediction method to exploit covariation signals (CSs) in proteins. We demonstrate that our proposed method is promising even for weak signals of biological interfaces. We also discuss the relation between classification accuracy and conservation of biological units, and illustrate how the selection of sequences included in multiple sequence alignments as sources for obtaining CSs affects the results. With increased amounts of sequence data, the proposed method is expected to become increasingly useful.

Highlights

In recent decades, various methods and analyses have been reported for identifying biological units in protein crystal structures
Protein Sparse InverseCOVariance (PSICOV)[24] filters out non-promising multiple sequence alignments (MSAs) with the diversity criterion: MSA must contain at least as many non-redundant sequence clusters as the query length
We compared the numbers of sequences in the MSAs, which can pass the PSICOV criterion, generated using databases of 2011 and 2016

Summary

Introduction

Various methods and analyses have been reported for identifying biological units in protein crystal structures. A few buried residues, comprising the so-called “core”, are more conserved than surrounding interface residues[16] Work in this area was improved further with a new classifier, EPPIC, which uses the Shannon entropy ratio based only on fairly similar homologous sequences. EPPIC performed well with the new difficult dataset[17] Some parameters such as contact size and geometric complementarity work to some degree, each parameter alone is not always sufficient for interface classification. We applied contact prediction for the interface classification problem in protein crystals where actual contacts are already given, which demonstrated that using features based on CSs is promising for this field of study. This study elucidates differences between the contact prediction of intrachain and that of the interaction interface

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Aug 30, 2019
Citations: 6	License type: open-access

R Discovery Prime

R Discovery Prime

Accurate Classification of Biological and non-Biological Interfaces in Protein Crystal Structures using Subtle Covariation Signals

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Evaluating Unexpectedly Short Non-covalent Distances in X-ray Crystal Structures of Proteins with Electronic Structure Analysis.
Helena W Qi ... Heather J Kulik
Journal of Chemical Information and Modeling | VOL. 59
Helena W Qi, et. al.Helena W Qi ... Heather J Kulik
15 Mar 2019
Journal of Chemical Information and Modeling | VOL. 59

Hydration of Biological Macromolecules in Solution: Surface Structure and Molecular Recognition
K Wuthrich
Cold Spring Harbor Symposia on Quantitative Biology | VOL. 58
K WuthrichK Wuthrich
01 Jan 1992
Cold Spring Harbor Symposia on Quantitative Biology | VOL. 58

Analysis of Ramachandran Outlier Prevalence for X‐ray Crystallographic Model Structure Validation
David A Vavrinak ... Charles Weiss
The FASEB Journal | VOL. 32
David A Vavrinak, et. al.David A Vavrinak ... Charles Weiss
01 Apr 2018
The FASEB Journal | VOL. 32

Automated protein crystal structure determination using ELVES.
James Holton ... Tom Alber
Proceedings of the National Academy of Sciences of the United States of America | VOL. 101
James Holton, et. al.James Holton ... Tom Alber
29 Jan 2004
Proceedings of the National Academy of Sciences of the United States of America | VOL. 101

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accurate Classification of Biological and non-Biological Interfaces in Protein Crystal Structures using Subtle Covariation Signals

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports