Abstract

We present a statistical analysis of protein structures based on interatomic C alpha distances. The overall distance distributions reflect in detail the contents of sequence-specific substructures maintained by local interactions (such as alpha-helixes) and longer range interactions (such as disulfide bridges and beta-sheets). We also show that a volume scaling of the distances makes distance distributions for protein chains of different length superimposable. Distance distributions were also calculated specifically for amino acids separated by a given number of residues. Specific features in these distributions are visible for sequence separations of up to 20 amino acid residues. A simple representation, which preserves most of the information in the distance distributions, was obtained using six parameters only. The parameters give rise to canonical distance intervals and when predicting coarse-grained distance constraints by methods such as data-driven artificial neural networks, these should preferably be selected from these intervals. We discuss the use of the six parameters for determining or reconstructing 3-D protein structures.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call