The state-of-the-art to assess the structural quality of docking models is currently based on three related yet independent quality measures: Fnat, LRMS, and iRMS as proposed and standardized by CAPRI. These quality measures quantify different aspects of the quality of a particular docking model and need to be viewed together to reveal the true quality, e.g. a model with relatively poor LRMS (>10Å) might still qualify as 'acceptable' with a descent Fnat (>0.50) and iRMS (<3.0Å). This is also the reason why the so called CAPRI criteria for assessing the quality of docking models is defined by applying various ad-hoc cutoffs on these measures to classify a docking model into the four classes: Incorrect, Acceptable, Medium, or High quality. This classification has been useful in CAPRI, but since models are grouped in only four bins it is also rather limiting, making it difficult to rank models, correlate with scoring functions or use it as target function in machine learning algorithms. Here, we present DockQ, a continuous protein-protein docking model quality measure derived by combining Fnat, LRMS, and iRMS to a single score in the range [0, 1] that can be used to assess the quality of protein docking models. By using DockQ on CAPRI models it is possible to almost completely reproduce the original CAPRI classification into Incorrect, Acceptable, Medium and High quality. An average PPV of 94% at 90% Recall demonstrating that there is no need to apply predefined ad-hoc cutoffs to classify docking models. Since DockQ recapitulates the CAPRI classification almost perfectly, it can be viewed as a higher resolution version of the CAPRI classification, making it possible to estimate model quality in a more quantitative way using Z-scores or sum of top ranked models, which has been so valuable for the CASP community. The possibility to directly correlate a quality measure to a scoring function has been crucial for the development of scoring functions for protein structure prediction, and DockQ should be useful in a similar development in the protein docking field. DockQ is available at http://github.com/bjornwallner/DockQ/
Intrinsically Disordered Proteins (IDPs) are enriched in charged and polar residues; and, therefore, electrostatic interactions play a predominant role in their dynamics. In order to remain multi-functional and exhibit their characteristic binding promiscuity, they need to retain considerable dynamic flexibility. At the same time, they also need to accommodate a large number of oppositely charged residues, which eventually lead to the formation of salt-bridges, imparting local rigidity. The formation of salt-bridges therefore oppose the desired dynamic flexibility. Hence, there appears to be a meticulous trade-off between the two mechanisms which the current study attempts to unravel. With this objective, we identify and analyze salt-bridges, both as isolated as well as composite ionic bond motifs, in the molecular dynamic trajectories of a set of appropriately chosen IDPs. Time evolved structural properties of these salt-bridges like persistence, associated secondary structural 'orderdisorder' transitions, correlated atomic movements, contribution in the overall electrostatic balance of the proteins have been studied in necessary detail. The results suggest that the key to maintain such a trade-off over time is the continuous formation and dissolution of salt-bridges with a wide range of persistence. Also, the continuous dynamic interchange of charged-atom-pairs (coming from a variety of oppositely charged side-chains) in the transient ionic bonds supports a model of dynamic flexibility concomitant with the well characterized stochastic conformational switching in these proteins. The results and conclusions should facilitate the future design of salt-bridges as a mean to further explore the disordered-globular interface in proteins.
Complementarity, in terms of both shape and electrostatic potential, has been quantitatively estimated at proteinprotein interfaces and used extensively to predict the specific geometry of association between interacting proteins. In this work, we attempted to place both binding and folding on a common conceptual platform based on complementarity. To that end, we estimated (for the first time to our knowledge) electrostatic complementarity (E m) for residues buried within proteins. E m measures the correlation of surface electrostatic potential at protein interiors. The results show fairly uniform and significant values for all amino acids. Interestingly, hydrophobic side chains also attain appreciable complementarity primarily due to the trajectory of the main chain. Previous work from our laboratory characterized the surface (or shape) complementarity (S m) of interior residues, and both of these measures have now been combined to derive two scoring functions to identify the native fold amid a set of decoys. These scoring functions are somewhat similar to functions that discriminate among multiple solutions in a protein-protein docking exercise. The performances of both of these functions on state-of-the-art databases were comparable if not better than most currently available scoring functions. Thus, analogously to interfacial residues of protein chains associated (docked) with specific geometry, amino acids found in the native interior have to satisfy fairly stringent constraints in terms of both S m and E m. The functions were also found to be useful for correctly identifying the same fold for two sequences with low sequence identity. Finally, inspired by the Ramachandran plot, we developed a plot of S m versus E m (referred to as the complementarity plot) that identifies residues with suboptimal packing and electrostatics which appear to be correlated to coordinate errors.
The DelPhiPKa is a widely used and unique approach to compute pKa’s of ionizable groups that does not require molecular surface to be defined. Instead, it uses smooth Gaussian-based dielectric function to treat computational space via Poisson-Boltzmann equation (PBE). Here we report an expansion of DelPhiPKa functionality to enable inclusion of salt in the modeling protocol. The method considers the salt mobile ions in solvent phase without defining solute-solvent boundary. Instead, the ions are penalized to enter solute interior via a desolvation penalty term in the Boltzmann factor in the framework of PBE. Hence, the concentration of ions near to protein is balanced by the desolvation penalty and electrostatic interactions. The study reveals that correlation between experimental and calculated pKa’s is improved significantly by taking into consideration the presence of salt. Furthermore, it is demonstrated that DelphiPKa reproduces the salt sensitivity of experimentally measured pKa’s. Another new development of DelPhiPKa allows for computing the pKa’s of polar residues such as cysteine, serine, threonine and tyrosine. With this regard, DelPhiPKa is benchmarked against experimentally measured cysteine and tyrosine pKa’s and for cysteine it is shown to outperform other existing methods (DelPhiPKa RMSD of 1.73 versus RMSD between 2.40 and 4.72 obtained by other existing pKa prediction methods).
BackgroundMapping protein primary sequences to their three dimensional folds referred to as the 'second genetic code' remains an unsolved scientific problem. A crucial part of the problem concerns the geometrical specificity in side chain association leading to densely packed protein cores, a hallmark of correctly folded native structures. Thus, any model of packing within proteins should constitute an indispensable component of protein folding and design.ResultsIn this study an attempt has been made to find, characterize and classify recurring patterns in the packing of side chain atoms within a protein which sustains its native fold. The interaction of side chain atoms within the protein core has been represented as a contact network based on the surface complementarity and overlap between associating side chain surfaces. Some network topologies definitely appear to be preferred and they have been termed 'packing motifs', analogous to super secondary structures in proteins. Study of the distribution of these motifs reveals the ubiquitous presence of typical smaller graphs, which appear to get linked or coalesce to give larger graphs, reminiscent of the nucleation-condensation model in protein folding. One such frequently occurring motif, also envisaged as the unit of clustering, the three residue clique was invariably found in regions of dense packing. Finally, topological measures based on surface contact networks appeared to be effective in discriminating sequences native to a specific fold amongst a set of decoys.ConclusionsOut of innumerable topological possibilities, only a finite number of specific packing motifs are actually realized in proteins. This small number of motifs could serve as a basis set in the construction of larger networks. Of these, the triplet clique exhibits distinct preference both in terms of composition and geometry.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.