Abstract

We exploit the transferability of quantum topological atoms in the construction of a multipolar polarizable protein force field QCTFF. A helical oligopeptide of 10 alanine residues (103 atoms) has its total electrostatic energy predicted using the kriging machine learning method with a mean error of 6.4 kJ mol−1. This error is similar to that found in smaller molecules presented in past QCTFF publications. Kriging relates the molecular geometry to atomic multipole moments that describe the ab initio electron density. Atom types are constructed from similar atoms within the helix. As the atoms within a given atom type share a local chemical environment, they can share a kriging model with a reduced number of input descriptors (i.e. features). The feature reduction decreases the kriging training times by more than 23 times but increases the prediction error by only 1.3 %. In transferability tests, transferable models give a 5.7 % error when predicting moments of an atom outside the training set, compared to the 3.9 % error when tested against data belonging to atoms included in the training data. The transferable kriging models successfully predict atomic multipole moments with useful accuracy, opening an avenue to QCTFF modelling of a whole protein.

Highlights

  • The simulation and analysis of large biomolecules is a task best handled by force fields, a state of affairs that will most likely remain so for the foreseeable future due to large computational cost of first-principle methods

  • In using machine learning models to describe an atom’s properties, we introduce a departure from how atom types are approached in typical force fields, with respect to their parameterisation

  • The REGULAR data sets represent the current paradigm for QCTFF as well as the best model quality currently achievable

Read more

Summary

Introduction

The simulation and analysis of large biomolecules is a task best handled by force fields, a state of affairs that will most likely remain so for the foreseeable future due to large computational cost of first-principle methods. We have reported on a different and new approach that predicts highrank multipolar and fully polarised electrostatic interaction energies in water clusters [1], ethanol [2], alanine [3], serine [4], N-methylacetamide and histidine [5], aromatic amino acids [6], hydrogen-bonded dimers [7], and atomic kinetic energies of methanol, N-methylacetamide, glycine, and triglycine [8] These studies feature a developing force field, QCTFF, which predicts electrostatic multipole moments using machine learning, fully taking into account polarisation.

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.