Hydrogen bonding is a key contributor to the exquisite specificity of the interactions within and between biological macromolecules, and hence accurate modeling of such interactions requires an accurate description of hydrogen bonding energetics. Here we investigate the orientation and distance dependence of hydrogen bonding energetics by combining two quite disparate but complementary approaches: quantum mechanical electronic structure calculations and protein structural analysis. We find a remarkable agreement between the energy landscapes obtained from the electronic structure calculations and the distributions of hydrogen bond geometries observed in protein structures. In contrast, molecular mechanics force fields commonly used for biomolecular simulations do not consistently exhibit close correspondence to either quantum mechanical calculations or experimentally observed hydrogen bonding geometries. These results suggest a route to improved energy functions for biological macromolecules that combines the generality of quantum mechanical electronic structure calculations with the accurate context dependence implicit in protein structural analysis.
Hydrogen bonds are partially covalent interactions between a hydrogen atom covalently bound to an electronegative atom and an electronegative acceptor atom (1). They play an important role in defining the structure and function of biological macromolecular systems and contribute to the specificity of molecular recognition (2), the formation of secondary structures (3), and the energetics of protein folding (4). Numerous studies of experimentally available protein and small molecule structures have revealed the directional character of hydrogen bonds and in particular the nonlinear geometry at the acceptor atom (5-8). Computational modeling of hydrogen bonding energy landscapes is a challenging problem; current approaches include quantum mechanical calculations on model systems (usually small molecules analogous to either a main-chain peptide unit or an amino acid side chain) (9, 10), molecular mechanics (force field) approaches (11-13), and knowledge-based potentials derived from small molecule structure databases (14) or the Protein Data Bank (PDB) (15,16).For application to macromolecular systems, different approaches have complementary strengths and weaknesses. Quantum mechanical electronic structure calculations are clearly the most fundamental and general but can be carried out at a rigorous level of theory only for systems much smaller than biological macromolecules, and the results are not necessarily transferable to the complex macromolecular environment. Empirical molecular mechanics force fields are constructed to apply generally to macromolecules and so are not subject to the size limitations of electronic structure calculations, but their accuracy may be limited by the (necessary) simplification of the quantum mechanical system and the large number of parameters that need to be obtained by fitting against experimental data. Inference of interaction ener...