We present a general framework for the development of datadriven many-body (MB) potential energy functions (MB-QM PEFs) that represent the interactions between small molecules at an arbitrary quantummechanical (QM) level of theory. As a demonstration, a family of MB-QM PEFs for water is rigorously derived from density functionals belonging to different rungs across Jacob's ladder of approximations within density functional theory (MB-DFT) and from Møller−Plesset perturbation theory (MB-MP2). Through a systematic analysis of individual MB contributions to the interaction energies of water clusters, we demonstrate that all MB-QM PEFs preserve the same accuracy as the corresponding ab initio calculations, with the exception of those derived from density functionals within the generalized gradient approximation (GGA). The differences between the DFT and MB-DFT results are traced back to density-driven errors that prevent GGA functionals from accurately representing the underlying molecular interactions for different cluster sizes and hydrogen-bonding arrangements. We show that this shortcoming may be overcome, within the MB formalism, by using density-corrected functionals (DC-DFT) that provide a more consistent representation of each individual MB contribution. This is demonstrated through the development of a MB-DFT PEF derived from DC-PBE-D3 data, which more accurately reproduce the corresponding ab initio results.
We investigate the interplay between functional-driven and density-driven errors in different density functional approximations within density functional theory (DFT) and the implications of these errors for simulations of water with DFT-based data-driven potentials. Specifically, we quantify density-driven errors in two widely used dispersion-corrected functionals derived within the generalized gradient approximation (GGA), namely BLYP-D3 and revPBE-D3, and two modern meta-GGA functionals, namely strongly constrained and appropriately normed (SCAN) and B97M-rV. The effects of functional-driven and density-driven errors on the interaction energies are first assessed for the water clusters of the BEGDB dataset. Further insights into the nature of functional-driven errors are gained from applying the absolutely localized molecular orbital energy decomposition analysis (ALMO-EDA) to the interaction energies, which demonstrates that functional-driven errors are strongly correlated with the nature of the interactions. We discuss cases where density-corrected DFT (DC-DFT) models display higher accuracy than the original DFT models and cases where reducing the density-driven errors leads to larger deviations from the reference energies due to the presence of large functional-driven errors. Finally, molecular dynamics simulations are performed with data-driven many-body potentials derived from DFT and DC-DFT data to determine the effect that minimizing density-driven errors has on the description of liquid water. Besides rationalizing the performance of widely used DFT models of water, we believe that our findings unveil fundamental relations between the shortcomings of some common DFT approximations and the requirements for accurate descriptions of molecular interactions, which will aid the development of a consistent, DFT-based framework for the development of data-driven and machine-learned potentials for simulations of condensed-phase systems.
<div> <div> <div> <p> </p><div> <div> <div> <p>We present a general framework for the development of data-driven many-body (MB) potential energy functions (MB-QM PEFs) that represent the interactions between small molecules at an arbitrary quantum-mechanical (QM) level of theory. As a demonstration, a family of MB-QM PEFs for water are rigorously derived from density functionals belonging to differ- ent rungs across Jacob’s ladder of approximations within density functional theory (MB-DFT) as well as from Møller-Plesset perturbation theory (MB-MP2). Through a systematic analysis of individual many-body contributions to the interaction energies of water clusters, we demonstrate that all MB-QM PEFs preserve the same accuracy as the corresponding ab initio calculations, with the exception of those derived from density functionals within the generalized gradient approximation (GGA). The differences between the DFT and MB-DFT results are traced back to density-driven errors that prevent GGA functionals from accurately representing the underlying molecular interactions for different cluster sizes and hydrogen-bonding arrangements. We show that this shortcoming may be overcome, within the many-body formalism, by using density-corrected functionals that provide a more consistent representation of each individual many-body contribution. This is demonstrated through the development of a MB-DFT PEF derived from density-corrected PBE-D3 data, which more accurately reproduce the corresponding ab initio results. </p> </div> </div> </div> </div> </div> </div>
We investigate the interplay between functional-driven and density-driven errors in different density functional theory (DFT) approximations, and the implications of these errors for simulations of water with DFT-based data-driven many-body potentials. Specifically, we quantify density-driven errors in two widely used dispersion-corrected functionals derived within the generalized gradient approximation (GGA), namely BLYP-D3 and revPBE-D3, and two modern meta-GGA functionals, namely SCAN and B97M-rV. The effects of functional-driven and density-driven errors on the interaction energies are assessed for the water clusters of the BEGDB dataset. Further insight into the nature of functional-driven errors is gained from applying the absolutely localized molecular orbital energy decomposition analysis (ALMO- EDA) to the interaction energies, which demonstrates that functional-driven errors are strongly correlated with the nature of the interactions. We discuss cases where density-corrected DFT (DC-DFT) models display higher accuracy than the original DFT models, and cases where reducing the density-driven errors leads to larger deviations from the reference energies due to the presence of large functional-driven errors. Finally, molecular dynamics simulations are performed with data-driven many-body potentials derived from DFT and DC-DFT data to determine the effect that minimizing density-driven errors has on the description of liquid water. Besides rationalizing the performance of widely used DFT models of water, we believe that our findings unveil fundamental relations between the shortcomings of some common DFT approximations and the requirements for accurate descriptions of molecular interactions, which will aid the development of a consistent, DFT-based framework for data-driven simulations of condensed-phase systems.
<div> <div> <div> <p> </p><div> <div> <div> <p>We present a general framework for the development of data-driven many-body (MB) potential energy functions (MB-QM PEFs) that represent the interactions between small molecules at an arbitrary quantum-mechanical (QM) level of theory. As a demonstration, a family of MB-QM PEFs for water are rigorously derived from density functionals belonging to differ- ent rungs across Jacob’s ladder of approximations within density functional theory (MB-DFT) as well as from Møller-Plesset perturbation theory (MB-MP2). Through a systematic analysis of individual many-body contributions to the interaction energies of water clusters, we demonstrate that all MB-QM PEFs preserve the same accuracy as the corresponding ab initio calculations, with the exception of those derived from density functionals within the generalized gradient approximation (GGA). The differences between the DFT and MB-DFT results are traced back to density-driven errors that prevent GGA functionals from accurately representing the underlying molecular interactions for different cluster sizes and hydrogen-bonding arrangements. We show that this shortcoming may be overcome, within the many-body formalism, by using density-corrected functionals that provide a more consistent representation of each individual many-body contribution. This is demonstrated through the development of a MB-DFT PEF derived from density-corrected PBE-D3 data, which more accurately reproduce the corresponding ab initio results. </p> </div> </div> </div> </div> </div> </div>
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.