Abstract

The study of the thermodynamics, kinetics, and microscopic mechanisms of chemical reactions in solution requires the use of advanced free-energy methods for predictions to be quantitative. This task is however a formidable one for atomistic simulation methods, as the cost of quantum-based ab initio approaches, to obtain statistically meaningful samplings of the relevant chemical spaces and networks, becomes exceedingly heavy. In this work, we critically assess the optimal structure and minimal size of an ab initio training set able to lead to accurate free-energy profiles sampled with neural network potentials. The results allow one to propose an ab initio protocol where the ad hoc inclusion of a machine-learning (ML)-based task can significantly increase the computational efficiency, while keeping the ab initio accuracy and, at the same time, avoiding some of the notorious extrapolation risks in typical atomistic ML approaches. We focus on two representative, and computationally challenging, reaction steps of the classic Strecker-cyanohydrin mechanism for glycine synthesis in water solution, where the main precursors are formaldehyde and hydrogen cyanide. We demonstrate that indistinguishable ab initio quality results are obtained, thanks to the ML subprotocol, at about 1 order of magnitude less of computational load.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.