In this study, we have revised the rules and parameters for one of the most commonly used empirical pKa predictors, PROPKA, based on better physical description of the desolvation and dielectric response for the protein. We have introduced a new and consistent approach to interpolate the description between the previously distinct classifications into internal and surface residues, which otherwise is found to give rise to an erratic and discontinuous behavior. Since the goal of this study is to lay out the framework and validate the concept, it focuses on Asp and Glu residues where the protein pKa values and structures are assumed to be more reliable. The new and improved implementation is evaluated and discussed; it is found to agree better with experiment than the previous implementation (in parentheses): rmsd = 0.79 (0.91) for Asp and Glu, 0.75 (0.97) for Tyr, 0.65 (0.72) for Lys, and 1.00 (1.37) for His residues. The most significant advance, however, is in reducing the number of outliers and removing unreasonable sensitivity to small structural changes that arise from classifying residues as either internal or surface.
The new empirical rules for protein pKa predictions implemented in the PROPKA3.0 software package (Olsson et al. J. Chem. Theory Comput.2010, 7, 525-537) have been extended to the prediction of pKa shifts of active site residues and ionizable ligand groups in protein-ligand complexes. We present new algorithms that allow pKa shifts due to inductive (i.e., covalently coupled) intraligand interactions, as well as noncovalently coupled interligand interactions in multiligand complexes, to be included in the prediction. The number of different ligand chemical groups that are automatically recognized has been increased to 18, and the general implementation has been changed so that new functional groups can be added easily by the user, aided by a new and more general protonation scheme. Except for a few cases, the new algorithms in PROPKA3.1 are found to yield results similar to or better than those obtained with PROPKA2.0 (Bas et al. Proteins: Struct., Funct., Bioinf.2008, 73, 765-783). Finally, we present a novel algorithm that identifies noncovalently coupled ionizable groups, where pKa prediction may be especially difficult. This is a general improvement to PROPKA and is applied to proteins with and without ligands.
Background: Charge states of ionizable residues in proteins determine their pH-dependent properties through their pK a values. Thus, various theoretical methods to determine ionization constants of residues in biological systems have been developed. One of the more widely used approaches for predicting pK a values in proteins is the PROPKA program, which provides convenient structural rationalization of the predicted pK a values without any additional calculations. Results: The PROPKA Graphical User Interface (GUI) is a new tool for studying the pH-dependent properties of proteins such as charge and stabilization energy. It facilitates a quantitative analysis of pK a values of ionizable residues together with their structural determinants by providing a direct link between the pK a data, predicted by the PROPKA calculations, and the structure via the Visual Molecular Dynamics (VMD) program. The GUI also calculates contributions to the pH-dependent unfolding free energy at a given pH for each ionizable group in the protein. Moreover, the PROPKA-computed pK a values or energy contributions of the ionizable residues in question can be displayed interactively. The PROPKA GUI can also be used for comparing pH-dependent properties of more than one structure at the same time. Conclusions: The GUI considerably extends the analysis and validation possibilities of the PROPKA approach. The PROPKA GUI can conveniently be used to investigate ionizable groups, and their interactions, of residues with significantly perturbed pK a values or residues that contribute to the stabilization energy the most. Chargedependent properties can be studied either for a single protein or simultaneously with other homologous structures, which makes it a helpful tool, for instance, in protein design studies or structure-based function predictions. The GUI is implemented as a Tcl/Tk plug-in for VMD, and can be obtained online at
Site-specific pK(a) values measured by NMR spectroscopy provide essential information on protein electrostatics, the pH-dependence of protein structure, dynamics and function, and constitute an important benchmark for protein pK(a) calculation algorithms. Titration curves can be measured by tracking the NMR chemical shifts of several reporter nuclei versus sample pH. However, careful analysis of these curves is needed to extract residue-specific pK(a) values since pH-dependent chemical shift changes can arise from many sources, including through-bond inductive effects, through-space electric field effects, and conformational changes. We have re-measured titration curves for all carboxylates and His 15 in Hen Egg White Lysozyme (HEWL) by recording the pH-dependent chemical shifts of all backbone amide nitrogens and protons, Asp/Glu side chain protons and carboxyl carbons, and imidazole protonated carbons and protons in this protein. We extracted pK(a) values from the resulting titration curves using standard fitting methods, and compared these values to each other, and with those measured previously by ¹H NMR (Bartik et al., Biophys J 1994;66:1180–1184). This analysis gives insights into the true accuracy associated with experimentally measured pK(a) values. We find that apparent pK(a) values frequently differ by 0.5–1.0 units depending upon the nuclei monitored, and that larger differences occasionally can be observed. The variation in measured pK(a) values, which reflects the difficulty in fitting and assigning pH-dependent chemical shifts to specific ionization equilibria, has significant implications for the experimental procedures used for measuring protein pK(a) values, for the benchmarking of protein pK(a) calculation algorithms, and for the understanding of protein electrostatics in general.
The development of docking scoring functions requires high-resolution 3D structures of protein-ligand complexes for which the binding affinity of the ligand has been measured experimentally. Protein-ligand binding affinities are measured in solution experiments, and high resolution protein-ligand structures can be determined only by X-ray crystallography. Protein-ligand scoring functions must therefore reproduce solution binding energies using analyses of proteins in a crystal environment. We present an analysis of the prevalence of crystal-induced artifacts and water-mediated contacts in protein-ligand complexes and demonstrate the effect that these can have on the performance of protein-ligand scoring functions. We find 36% of ligands in the PDBBind 2007 refined data set to be influenced by crystal contacts and find the performance of a scoring function to be affected by these. A Web server for detecting crystal contacts in protein-ligand complexes is available at http://enzyme.ucd.ie/LIGCRYST .
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.