Abstract

BackgroundElucidating the native structure of a protein molecule from its sequence of amino acids, a problem known as de novo structure prediction, is a long standing challenge in computational structural biology. Difficulties in silico arise due to the high dimensionality of the protein conformational space and the ruggedness of the associated energy surface. The issue of multiple minima is a particularly troublesome hallmark of energy surfaces probed with current energy functions. In contrast to the true energy surface, these surfaces are weakly-funneled and rich in comparably deep minima populated by non-native structures. For this reason, many algorithms seek to be inclusive and obtain a broad view of the low-energy regions through an ensemble of low-energy (decoy) conformations. Conformational diversity in this ensemble is key to increasing the likelihood that the native structure has been captured.MethodsWe propose an evolutionary search approach to address the multiple-minima problem in decoy sampling for de novo structure prediction. Two population-based evolutionary search algorithms are presented that follow the basic approach of treating conformations as individuals in an evolving population. Coarse graining and molecular fragment replacement are used to efficiently obtain protein-like child conformations from parents. Potential energy is used both to bias parent selection and determine which subset of parents and children will be retained in the evolving population. The effect on the decoy ensemble of sampling minima directly is measured by additionally mapping a conformation to its nearest local minimum before considering it for retainment. The resulting memetic algorithm thus evolves not just a population of conformations but a population of local minima.Results and conclusionsResults show that both algorithms are effective in terms of sampling conformations in proximity of the known native structure. The additional minimization is shown to be key to enhancing sampling capability and obtaining a diverse ensemble of decoy conformations, circumventing premature convergence to sub-optimal regions in the conformational space, and approaching the native structure with proximity that is comparable to state-of-the-art decoy sampling methods. The results are shown to be robust and valid when using two representative state-of-the-art coarse-grained energy functions.

Highlights

  • Elucidating the native structure of a protein molecule from its sequence of amino acids, a problem known as de novo structure prediction, is a long standing challenge in computational structural biology

  • The results presented below focus on comparing the sampling ability between the two algorithms with respect to obtained lowest energy and proximity to the native structure

  • The results obtained by the Memetic EA (MEA) using the Associative Memory Hamiltonian with Water (AMW) energy function are compared with those obtained using the Rosetta score3 energy function

Read more

Summary

Introduction

Elucidating the native structure of a protein molecule from its sequence of amino acids, a problem known as de novo structure prediction, is a long standing challenge in computational structural biology. Elucidating a representative three-dimensional structure of the protein native state, referred to as the native structure, is essential in structure-driven studies on engineering novel proteins, predicting protein stability, modeling interactions upon protein complexation, and designing effective drugs [1,2,3,4,5,6,7,8,9] While experimental techniques, such as X-ray crystallography, nuclear magnetic resonance, and cryo-electron microscopy, are able to extract atomic coordinates of the native structure, these techniques are laborious, expensive, and have limitations either in the subset of proteins on which they can be applied or in the resolution and quality of their findings. Computational approaches are necessary to model the protein native state

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call