Abstract
MotivationTo understand protein structure, folding and function fully and to design proteins de novo reliably, we must learn from natural protein structures that have been characterized experimentally. The number of protein structures available is large and growing exponentially, which makes this task challenging. Indeed, computational resources are becoming increasingly important for classifying and analyzing this resource. Here, we use tools from graph theory to define an Atlas classification scheme for automatically categorizing certain protein substructures.ResultsFocusing on the α-helical coiled coils, which are ubiquitous protein-structure and protein–protein interaction motifs, we present a suite of computational resources designed for analyzing these assemblies. iSOCKET enables interactive analysis of side-chain packing within proteins to identify coiled coils automatically and with considerable user control. Applying a graph theory-based Atlas classification scheme to structures identified by iSOCKET gives the Atlas of Coiled Coils, a fully automated, updated overview of extant coiled coils. The utility of this approach is illustrated with the first formal classification of an emerging subclass of coiled coils called α-helical barrels. Furthermore, in the Atlas, the known coiled-coil universe is presented alongside a partial enumeration of the ‘dark matter’ of coiled-coil structures; i.e. those coiled-coil architectures that are theoretically possible but have not been observed to date, and thus present defined targets for protein design.Availability and implementationiSOCKET is available as part of the open-source GitHub repository associated with this work (https://github.com/woolfson-group/isocket). This repository also contains all the data generated when classifying the protein graphs. The Atlas of Coiled Coils is available at: http://coiledcoils.chm.bris.ac.uk/atlas/app.
Highlights
With more than 130 000 structures currently available in the Protein Data Bank (PDB) (Berman, 2000), the need for protein-structure classification is clear (Andreeva et al, 2014; Sillitoe et al, 2015)
To represent coiled coils as graphs, we have developed a Python-based implementation of the program SOCKET (Walshaw and Woolfson, 2001) for identifying KIH interactions, and coiled coils, within protein structures
ISOCKET follows a similar procedure for identifying knobs-into-holes (KIH) interactions to that described fully in the original SOCKET paper (Walshaw and Woolfson, 2001). This proceeds as follows: Given a protein structure, the a helices are extracted and the centroid of the side-chain is stored for each residue
Summary
With more than 130 000 structures currently available in the Protein Data Bank (PDB) (Berman, 2000), the need for protein-structure classification is clear (Andreeva et al, 2014; Sillitoe et al, 2015). Such classifications demonstrate the structural diversity exhibited by proteins in nature; develop our understanding of proteins; and facilitate comparisons between structures. Protein-structure classifications provide inspiration for protein designers to identify the structures that are not yet present in these schemes and construct them de novo (Kuhlman et al, 2003; Michalopoulos et al, 2004; Thomson et al, 2014; Zaccai et al, 2011).
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.