Abstract

The analysis of large structural databases reveals general features and relationships among proteins, providing useful insight. A different approach is required to characterize ubiquitous secondary-structure elements, where flexibility is essential in order to capture small local differences. The ALEPH software is optimized for the analysis and the extraction of small protein folds by relying on their geometry rather than on their sequence. The annotation of the structural variability of a given fold provides valuable information for fragment-based molecular-replacement methods, in which testing alternative model hypotheses can succeed in solving difficult structures when no homology models are available or are successful. ARCIMBOLDO_BORGES combines the use of composite secondary-structure elements as a search model with density modification and tracing to reveal the rest of the structure when both steps are successful. This phasing method relies on general fold libraries describing variations around a given pattern of β-sheets and helices extracted using ALEPH. The program introduces characteristic vectors defined from the main-chain atoms as a way to describe the geometrical properties of the structure. ALEPH encodes structural properties in a graph network, the exploration of which allows secondary-structure annotation, decomposition of a structure into small compact folds, generation of libraries of models representing a variation ofa given fold and finally superposition of these folds onto a target structure. These functions are available through a graphical interface designed to interactively show the results of structure manipulation, annotation, fold decomposition, clustering and library generation. ALEPH can produce pictures of the graphs, structures and folds for publication purposes.

Highlights

  • Secondary-structure properties are usually derived from the hydrogen-bond pattern

  • The ALEPH software is optimized for the analysis and the extraction of small protein folds by relying on their geometry rather than on their sequence

  • ALEPH encodes structural properties in a graph network, the exploration of which allows secondary-structure annotation, decomposition of a structure into small compact folds, generation of libraries of models representing a variation of a given fold and superposition of these folds onto a target structure

Read more

Summary

Introduction

Secondary-structure properties are usually derived from the hydrogen-bond pattern. They were predicted even before the structures of full proteins had been determined (Pauling et al, 1951; Pauling & Corey, 1951). The advantage of CVs is that the same reduction in dimensionality can be applied within different scopes: for example, the environment of single amino acids, when CVs are calculated over overlapping tripeptides, or to secondary-structure units in a fold, when CVs are defined over such longer stretches This formalism is useful for the geometric description of the small fragments used for phasing in the ARCIMBOLDO programs (Millan et al, 2015). We present the CV-based program ALEPH, which was developed as a bioinformatics tool to handle fragments and prepare libraries representing variations of a given fold for MR Extraction of such libraries is performed without relying on sequences or alignments to allow searches across different families

Software versions
Computing setup
ALEPH as a composite bioinformatics tool
Secondary- and tertiary-structure annotation
Decomposition through community clustering
Library generation
Superposition
Distributed libraries
Conclusion
Distance score between matrices in extraction
Findings
Funding information
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call