From Potentials to Properties: Data-Driven Many-Body Simulations of Water and Aqueous Systems.

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon

Water is one of the most extensively studied molecular systems, yet its behavior across different phases, interfaces, and chemical environments continues to challenge existing models. Over the past decade, the development of data-driven many-body potential energy functions (PEFs) within the many-body energy (MB-nrg) formalism has enabled simulations of water and aqueous systems with unprecedented predictive power. Rooted in the many-body expansion and rigorously derived from "gold standard" electronic structure data, these PEFs bridge quantum chemistry and statistical mechanics within a unified framework. In this article, we review how the MB-pol PEF for water and the MB-nrg PEFs for hydrated halide and alkali metal ions have reshaped our understanding of aqueous properties across the gas, liquid, and solid phases, offering detailed insights into hydrogen bonding, spectroscopy, isotope effects, and phase stability. By connecting length scales and timescales while maintaining quantum-mechanical accuracy, the data-driven many-body MB-nrg formalism provides a robust foundation for realistic molecular simulations and offers new opportunities for addressing long-standing questions in physical chemistry and beyond.

Similar Papers
  • Research Article
  • Cite Count Icon 49
  • 10.1063/5.0156036
MBX: A many-body energy and force calculator for data-driven many-body simulations.
  • Aug 1, 2023
  • The Journal of chemical physics
  • Marc Riera + 7 more

Many-Body eXpansion (MBX) is a C++ library that implements many-body potential energy functions (PEFs) within the "many-body energy" (MB-nrg) formalism. MB-nrg PEFs integrate an underlying polarizable model with explicit machine-learned representations of many-body interactions to achieve chemical accuracy from the gas to the condensed phases. MBX can be employed either as a stand-alone package or as an energy/force engine that can be integrated with generic software for molecular dynamics and Monte Carlo simulations. MBX is parallelized internally using Open Multi-Processing and can utilize Message Passing Interface when available in interfaced molecular simulation software. MBX enables classical and quantum molecular simulations with MB-nrg PEFs, as well as hybrid simulations that combine conventional force fields and MB-nrg PEFs, for diverse systems ranging from small gas-phase clusters to aqueous solutions and molecular fluids to biomolecular systems and metal-organic frameworks.

  • Research Article
  • Cite Count Icon 34
  • 10.1063/5.0063198
MB-Fit: Software infrastructure for data-driven many-body potential energy functions.
  • Sep 23, 2021
  • The Journal of Chemical Physics
  • Ethan F Bull-Vulpe + 3 more

Many-body potential energy functions (MB-PEFs), which integrate data-driven representations of many-body short-range quantum mechanical interactions with physics-based representations of many-body polarization and long-range interactions, have recently been shown to provide high accuracy in the description of molecular interactions from the gas to the condensed phase. Here, we present MB-Fit, a software infrastructure for the automated development of MB-PEFs for generic molecules within the TTM-nrg (Thole-type model energy) and MB-nrg (many-body energy) theoretical frameworks. Besides providing all the necessary computational tools for generating TTM-nrg and MB-nrg PEFs, MB-Fit provides a seamless interface with the MBX software, a many-body energy and force calculator for computer simulations. Given the demonstrated accuracy of the MB-PEFs, particularly within the MB-nrg framework, we believe that MB-Fit will enable routine predictive computer simulations of generic (small) molecules in the gas, liquid, and solid phases, including, but not limited to, the modeling of quantum isomeric equilibria in molecular clusters, solvation processes, molecular crystals, and phase diagrams.

  • Research Article
  • Cite Count Icon 82
  • 10.1063/1.5005540
Vibrational spectra of halide-water dimers: Insights on ion hydration from full-dimensional quantum calculations on many-body potential energy surfaces
  • Nov 22, 2017
  • The Journal of Chemical Physics
  • Pushp Bajaj + 3 more

Full-dimensional vibrational spectra are calculated for both X-(H2O) and X-(D2O) dimers (X = F, Cl, Br, I) at the quantum-mechanical level. The calculations are carried out on two sets of recently developed potential energy functions (PEFs), namely, Thole-type model energy (TTM-nrg) and many-body energy (MB-nrg), using the symmetry-adapted Lanczos algorithm with a product basis set including all six vibrational coordinates. Although both TTM-nrg and MB-nrg PEFs are derived from coupled-cluster single double triple-F12 data obtained in the complete basis set limit, they differ in how many-body effects are represented at short range. Specifically, while both models describe long-range interactions through the combination of two-body dispersion and many-body classical electrostatics, the relatively simple Born-Mayer functions employed in the TTM-nrg PEFs to represent short-range interactions are replaced in the MB-nrg PEFs by permutationally invariant polynomials to achieve chemical accuracy. For all dimers, the MB-nrg vibrational spectra are in close agreement with the available experimental data, correctly reproducing anharmonic and nuclear quantum effects. In contrast, the vibrational frequencies calculated with the TTM-nrg PEFs exhibit significant deviations from the experimental values. The comparison between the TTM-nrg and MB-nrg results thus reinforces the notion that an accurate representation of both short-range interactions associated with electron density overlap and long-range many-body electrostatic interactions is necessary for a correct description of hydration phenomena at the molecular level.

  • Research Article
  • Cite Count Icon 3
  • 10.1007/s12648-019-01436-4
Higher-term contributions in the many-body calculation of the compressibility and thermodynamic properties of solid neon
  • Apr 1, 2019
  • Indian Journal of Physics
  • X R Zheng

To investigate both compressibility and thermodynamic properties of solid face-centered cubic neon, the many-body potential energy, which is expanded as a sum of two- to five-body potentials, was calculated. The calculation used the ab initio Hartree–Fock self-consistent field method in combination with the many-body expansion method. The results indicate that the many-body expansion potential is an exchange convergent series, and the even many-body potential contributions to the cohesive energy are repulsive. The odd many-body potential contributions to the cohesive energy, on the other hand, are attractive. The absolute values of the many-body potential energy Un obey |Un| > |Un+1|. Both the many-body potential energy and the total potential energy tend to saturate with the increase in atomic numbers and neighboring shell numbers. When the atomic distance R exceeds 2.60 A, the interaction energy may be described by two-body interactions. For atomic distances between 1.80 and 2.60 A, a three-body contribution to the many-body expansion potential is required, while for distances between 1.60 and 1.80 A, four-body contributions need to be considered. The calculated isotherm is in good agreement with the obtained experimental results in the studied pressure range (0–237 GPa), which considers the four-body potential if the pressure reaches 240 GPa. Below 1.60 A, we have to consider the five-body potential to accurately match the experimental data (for the pressure up to 280 GPa). Overall, the inclusion of the high many-body interaction energy makes it possible to obtain the most accurate equation of the state for solid neon under ambient conditions and higher pressure.

  • Preprint Article
  • 10.26434/chemrxiv-2021-pjr3l-v2
MB-Fit: Software Infrastructure for Data-Driven Many-Body Potential Energy Functions
  • Jul 16, 2021
  • Ethan Bull-Vulpe + 3 more

Many-body potential energy functions (MB-PEFs), which integrate data-driven representations of many-body short-range quantum mechanical interactions with physics-based representations of many-body polarization and long-range interactions, have recently been shown to provide high accuracy in the description of molecular interactions, from the gas to the condensed phase. Here, we present MB-Fit, a software infrastructure for the auto- mated development of MB-PEFs for generic molecules within the TTM-nrg (“Thole-type model energy”) and MB-nrg (“many-body energy”) theoretical frameworks. Besides providing all the necessary computational tools for generating TTM-nrg and MB-nrg PEFs, MB-Fit provides a seamless interface with the MBX software, a many-body energy/force calculator for computer simulations. Given the demonstrated accuracy of the MB-PEFs, we believe that MB-Fit will enable routine, predictive computer simulations of generic (small) molecules in the gas, liquid, and solid phases, including, but not limited to, the modeling of isomeric equilibria in molecular clusters, solvation processes, molecular crystals, and phase diagrams.

  • Research Article
  • Cite Count Icon 37
  • 10.1021/acs.jctc.2c00645
Data-Driven Many-Body Potential Energy Functions for Generic Molecules: Linear Alkanes as a Proof-of-Concept Application.
  • Sep 16, 2022
  • Journal of Chemical Theory and Computation
  • Ethan F Bull-Vulpe + 3 more

We present a generalization of the many-body energy (MB-nrg) theoretical/computational framework that enables the development of data-driven potential energy functions (PEFs) for generic covalently bonded molecules, with arbitrary quantum mechanical accuracy. The "nearsightedness of electronic matter" is exploited to define monomers as "natural building blocks" on the basis of their distinct chemical identity. The energy of generic molecules is then expressed as a sum of individual many-body energies of incrementally larger subsystems. The MB-nrg PEFs represent the low-order n-body energies, with n = 1-4, using permutationally invariant polynomials derived from electronic structure data carried out at an arbitrary quantum mechanical level of theory, while all higher-order n-body terms (n > 4) are represented by a classical many-body polarization term. As a proof-of-concept application of the general MB-nrg framework, we present MB-nrg PEFs for linear alkanes. The MB-nrg PEFs are shown to accurately reproduce reference energies, harmonic frequencies, and potential energy scans of alkanes, independently of their length. Since, by construction, the MB-nrg framework introduced here can be applied to generic covalently bonded molecules, we envision future computer simulations of complex molecular systems using data-driven MB-nrg PEFs, with arbitrary quantum mechanical accuracy.

  • Research Article
  • Cite Count Icon 127
  • 10.1063/1.4993213
Toward chemical accuracy in the description of ion-water interactions through many-body representations. Alkali-water dimer potential energy surfaces.
  • Jul 24, 2017
  • The Journal of Chemical Physics
  • Marc Riera + 4 more

This study presents the extension of the MB-nrg (Many-Body energy) theoretical/computational framework of transferable potential energy functions (PEFs) for molecular simulations of alkali metal ion-water systems. The MB-nrg PEFs are built upon the many-body expansion of the total energy and include the explicit treatment of one-body, two-body, and three-body interactions, with all higher-order contributions described by classical induction. This study focuses on the MB-nrg two-body terms describing the full-dimensional potential energy surfaces of the M+(H2O) dimers, where M+ = Li+, Na+, K+, Rb+, and Cs+. The MB-nrg PEFs are derived entirely from "first principles" calculations carried out at the explicitly correlated coupled-cluster level including single, double, and perturbative triple excitations [CCSD(T)-F12b] for Li+ and Na+ and at the CCSD(T) level for K+, Rb+, and Cs+. The accuracy of the MB-nrg PEFs is systematically assessed through an extensive analysis of interaction energies, structures, and harmonic frequencies for all five M+(H2O) dimers. In all cases, the MB-nrg PEFs are shown to be superior to both polarizable force fields and ab initio models based on density functional theory. As previously demonstrated for halide-water dimers, the MB-nrg PEFs achieve higher accuracy by correctly describing short-range quantum-mechanical effects associated with electron density overlap as well as long-range electrostatic many-body interactions.

  • Research Article
  • Cite Count Icon 330
  • 10.1103/physrev.95.217
Two-Body Forces and Nuclear Saturation. I. Central Forces
  • Jul 1, 1954
  • Physical Review
  • K A Brueckner + 2 more

The problem of nuclear saturation for the rapidly varying and nonmonotonic potentials of pseudoscalar meson theory has been investigated. In these potentials, variational methods using independent-particle trial functions are grossly inadequate. Although the problem can be approached using more general variational functions with interparticle correlation, the evaluation of the resulting expressions is very difficult since indirect correlations involving many more than two particles become important at high densities. An alternative procedure has been developed which allows a rather straightforward evaluation of the many-body energy even when the potentials are of great complexity. This method depends on a treatment of the coherent particle motion which is exact in the limit of very many scatterers, and treats the incoherent motion as a perturbation. In this case the many-body potential energy can be simply expressed in terms of the low-energy scattering amplitudes. This method has been applied to the two-body potentials given by pseudoscalar meson theory when the effects of nucleon pair formation are assumed to be small. In this approximation the many-body forces of the theory are negligible. These potentials given an excellent fit to the low-energy scattering parameters of the two-nucleon system and also an approximately correct description of scattering up to 90 Mev. They are characterized by repulsive cores of radii 0.3-0.4\ensuremath{\Elzxh}/\ensuremath{\mu}c and quite weak interactions in odd states. The many-body energy has been evaluated neglecting the tensor contributions which average to zero in first approximation. The result shows saturation at an energy per particle (neglecting Coulomb energy) of 12 Mev at a nuclear radius of $1.15\ifmmode\times\else\texttimes\fi{}{10}^{\ensuremath{-}13}{A}^{\frac{1}{3}}$ cm. The method has also been applied to potentials of the L\'evy type in which the odd-state potentials are rather strong and attractive. To give saturation with these near normal density, a 3-body force of the type given by the pair terms in the pseudoscalar coupling with a coupling constant $\frac{{g}^{2}}{4\ensuremath{\pi}}\ensuremath{\sim}3$ is required. Finally, the method can easily be extended to the determination of the elastic interaction of a slow neutron with a nucleus. The resulting "Weisskopf" potential has a depth of 35 Mev.

  • Research Article
  • Cite Count Icon 16
  • 10.1021/ct400488x
Reactive Many-Body Expansion for a Protonated Water Cluster.
  • Dec 30, 2013
  • Journal of Chemical Theory and Computation
  • Peter Pinski + 1 more

We generalize the standard many-body expansion technique that is used to approximate the total energy of a molecular system to enable the treatment of chemical reactions by quantum chemical techniques. By considering all possible assignments of atoms to monomer units of the many-body expansion and associating suitable weights with each, we construct a potential energy surface that is a smooth function of the nuclear positions. We derive expressions for this reactive many-body expansion energy and describe an algorithm for its evaluation, which scales polynomially with system size, and therefore will make the method feasible for future condensed phase simulations. We demonstrate the accuracy and smoothness of the resulting potential energy surface on a molecular dynamics trajectory of the protonated water hexamer, using the Hartree-Fock method for the many-body term and Møller-Plesset theory for the low order terms of the many-body expansion.

  • Research Article
  • Cite Count Icon 33
  • 10.1016/j.joule.2017.10.011
Electrochemical Energy Storage with Mediator-Ion Solid Electrolytes
  • Nov 1, 2017
  • Joule
  • Xingwen Yu + 1 more

Electrochemical Energy Storage with Mediator-Ion Solid Electrolytes

  • Research Article
  • Cite Count Icon 25
  • 10.1016/j.molliq.2009.02.009
Viscosities of ammonium salts in water and ethanol + water systems at different temperatures
  • Mar 6, 2009
  • Journal of Molecular Liquids
  • Rehana Saeed + 3 more

Viscosities of ammonium salts in water and ethanol + water systems at different temperatures

  • Research Article
  • Cite Count Icon 206
  • 10.1063/1.5024577
Comparison of permutationally invariant polynomials, neural networks, and Gaussian approximation potentials in representing water interactions through many-body expansions.
  • Apr 9, 2018
  • The Journal of Chemical Physics
  • Thuong T Nguyen + 7 more

The accurate representation of multidimensional potential energy surfaces is a necessary requirement for realistic computer simulations of molecular systems. The continued increase in computer power accompanied by advances in correlated electronic structure methods nowadays enables routine calculations of accurate interaction energies for small systems, which can then be used as references for the development of analytical potential energy functions (PEFs) rigorously derived from many-body (MB) expansions. Building on the accuracy of the MB-pol many-body PEF, we investigate here the performance of permutationally invariant polynomials (PIPs), neural networks, and Gaussian approximation potentials (GAPs) in representing water two-body and three-body interaction energies, denoting the resulting potentials PIP-MB-pol, Behler-Parrinello neural network-MB-pol, and GAP-MB-pol, respectively. Our analysis shows that all three analytical representations exhibit similar levels of accuracy in reproducing both two-body and three-body reference data as well as interaction energies of small water clusters obtained from calculations carried out at the coupled cluster level of theory, the current gold standard for chemical accuracy. These results demonstrate the synergy between interatomic potentials formulated in terms of a many-body expansion, such as MB-pol, that are physically sound and transferable, and machine-learning techniques that provide a flexible framework to approximate the short-range interaction energy terms.

  • Research Article
  • Cite Count Icon 35
  • 10.1021/ar500068a
Quantum mechanical fragment methods based on partitioning atoms or partitioning coordinates.
  • May 19, 2014
  • Accounts of Chemical Research
  • Bo Wang + 5 more

Conspectus The development of more efficient and more accurate ways to represent reactive potential energy surfaces is a requirement for extending the simulation of large systems to more complex systems, longer-time dynamical processes, and more complete statistical mechanical sampling. One way to treat large systems is by direct dynamics fragment methods. Another way is by fitting system-specific analytic potential energy functions with methods adapted to large systems. Here we consider both approaches. First we consider three fragment methods that allow a given monomer to appear in more than one fragment. The first two approaches are the electrostatically embedded many-body (EE-MB) expansion and the electrostatically embedded many-body expansion of the correlation energy (EE-MB-CE), which we have shown to yield quite accurate results even when one restricts the calculations to include only electrostatically embedded dimers. The third fragment method is the electrostatically embedded molecular tailoring approach (EE-MTA), which is more flexible than EE-MB and EE-MB-CE. We show that electrostatic embedding greatly improves the accuracy of these approaches compared with the original unembedded approaches. Quantum mechanical fragment methods share with combined quantum mechanical/molecular mechanical (QM/MM) methods the need to treat a quantum mechanical fragment in the presence of the rest of the system, which is especially challenging for those parts of the rest of the system that are close to the boundary of the quantum mechanical fragment. This is a delicate matter even for fragments that are not covalently bonded to the rest of the system, but it becomes even more difficult when the boundary of the quantum mechanical fragment cuts a bond. We have developed a suite of methods for more realistically treating interactions across such boundaries. These methods include redistributing and balancing the external partial atomic charges and the use of tuned fluorine atoms for capping dangling bonds, and we have shown that they can greatly improve the accuracy. Finally we present a new approach that goes beyond QM/MM by combining the convenience of molecular mechanics with the accuracy of fitting a potential function to electronic structure calculations on a specific system. To make the latter practical for systems with a large number of degrees of freedom, we developed a method to interpolate between local internal-coordinate fits to the potential energy. A key issue for the application to large systems is that rather than assigning the atoms or monomers to fragments, we assign the internal coordinates to reaction, secondary, and tertiary sets. Thus, we make a partition in coordinate space rather than atom space. Fits to the local dependence of the potential energy on tertiary coordinates are arrayed along a preselected reaction coordinate at a sequence of geometries called anchor points; the potential energy function is called an anchor points reactive potential. Electrostatically embedded fragment methods and the anchor points reactive potential, because they are based on treating an entire system by quantum mechanical electronic structure methods but are affordable for large and complex systems, have the potential to open new areas for accurate simulations where combined QM/MM methods are inadequate.

  • Research Article
  • Cite Count Icon 5
  • 10.1007/s00894-010-0675-y
Many-body energies during proton transfer in an aqueous system
  • Feb 27, 2010
  • Journal of Molecular Modeling
  • Ajay Chaudhari + 2 more

The energetics of the mechanism of proton transfer from a hydronium ion to one of the water molecules in its first solvation shell are studied using density functional theory and the Møller-Plesset perturbation (MP2) method. The potential energy surface of the proton transfer mechanism is obtained at the B3LYP and MP2 levels with the 6-311++G** basis set. Many-body analysis is applied to the proton transfer mechanism to obtain the change in relaxation energy, two-body, three-body and four-body energies when proton transfer occurs from the hydronium ion to one of the water molecules in its first solvation shell. It is observed that the binding energy (BE) of the complex decreases during the proton transfer process at both levels of theory. During the proton transfer process, the % contribution of the total two-body energy to the binding energy of the complex increases from 62.9 to 68.09% (39.9 to 45.95%), and that of the total three-body increases from 25.9 to 27.09% (24.16 to 26.17%) at the B3LYP/6-311++G** (MP2/ 6-311++G**) level. There is almost no change in the water-water-water three-body interaction energy during the proton transfer process at both levels of theory. The contribution of the relaxation energy and the total four-body energy to the binding energy of the complex is greater at the MP2 level than at the B3LYP level. Significant differences are found between the relaxation energies, the hydronium-water interaction energies and the four-body interaction energies at the B3LYP and MP2 levels.

  • Preprint Article
  • Cite Count Icon 24
  • 10.26434/chemrxiv-2021-hstgf-v3
Elevating Density Functional Theory to Chemical Accuracy for Water Simulations through a Density-Corrected Many-Body Formalism
  • Oct 5, 2021
  • Saswata Dasgupta + 3 more

Density functional theory (DFT) has been extensively used to model the properties of water. Albeit maintaining a good balance between accuracy and efficiency, no density functional has so far achieved the degree of accuracy necessary to correctly predict the properties of water across the entire phase diagram. Here, we present density-corrected SCAN (DC-SCAN) calculations for water which, minimizing density-driven errors, elevate the accuracy of the SCAN functional to that of “gold standard” coupled-cluster theory. Building upon the accuracy of DC-SCAN within a many-body formalism, we introduce a data-driven many-body potential energy function, MB-SCAN(DC), that quantitatively reproduces coupled cluster reference values for interaction, binding, and individual many-body energies of water clusters. Importantly, molecular dynamics simulations carried out with MB-SCAN(DC) also reproduce the properties of liquid water, which thus demonstrates that MB-SCAN(DC) is effectively the first DFT-based model that correctly describes water from the gas to the liquid phase.

Save Icon
Up Arrow
Open/Close