Kinase structures in the inactive “DFG‐out” state provide a wealth of druggable binding site variants. The conformational plasticity of this state can be mainly described by different conformations of binding site‐forming elements such as DFG motif, A‐loop, P‐loop, and αC‐helix. Compared to DFG‐in structures, DFG‐out structures are largely underrepresented in the Protein Data Bank (PDB). Thus, structure‐based drug design efforts for DFG‐out inhibitors may benefit from an efficient approach to generate an ensemble of DFG‐out structures. Accordingly, the presented modeling pipeline systematically generates homology models of kinases in several DFG‐out conformations based on a sophisticated creation of template structures that represent the major states of the flexible structural elements. Eighteen template classes were initially selected from all available kinase structures in the PDB and subsequently employed for modeling the entire kinome in different DFG‐out variants by fusing individual structural elements to multiple chimeric template structures. Molecular dynamics simulations revealed that conformational transitions between the different DFG‐out states generally do not occur within trajectories of a few hundred nanoseconds length. This underlines the benefits of the presented homology modeling pipeline to generate relevant conformations of “DFG‐out” kinase structures for subsequent in silico screening or binding site analysis studies.
Motivation Co-evolution analysis can be used to accurately predict residue-residue contacts from multiple sequence alignments. The introduction of machine-learning techniques has enabled substantial improvements in precision and a shift from predicting binary contacts to predicting distances between pairs of residues. These developments have significantly improved the accuracy of de novo prediction of static protein structures. With AlphaFold2 lifting the accuracy of some predicted protein models close to experimental levels, structure prediction research will move on to other challenges. One of those areas is the prediction of more than one conformation of a protein. Here we examine the potential of residue-residue distance predictions to be informative of protein flexibility rather than simply static structure. Results We used DMPfold to predict distance distributions for every residue pair in a set of proteins that showed both rigid and flexible behaviour. Residue pairs that were in contact in at least one reference structure were classified as rigid, flexible or neither. The predicted distance distribution of each residue pair was analysed for local maxima of probability indicating the most likely distance or distances between a pair of residues. We found that rigid residue pairs tended to have only a single local maximum in their predicted distance distributions while flexible residue pairs more often had multiple local maxima. These results suggest that the shape of predicted distance distributions contains information on the rigidity or flexibility of a protein and its constituent residues. Supplementary information Supplementary data are available at Bioinformatics online.
The GroE chaperonin system, which comprises GroEL and GroES, assists protein folding in vivo and in vitro. It is conserved in all prokaryotes except in most, but not all, members of the class of mollicutes. In Escherichia coli, about 60 proteins were found to be obligatory clients of the GroE system. Here, we describe the properties of the homologs of these GroE clients in mollicutes and the evolution of chaperonins in this class of bacteria. Comparing the properties of these homologs in mollicutes with and without chaperonins enabled us to search for features correlated with the presence of GroE. Interestingly, no sequence-based features of proteins such as average length, amino acid composition and predicted folding/disorder propensity were found to be affected by the absence of GroE. Other properties such as genome size and number of proteins were also found to not differ between mollicute species with and without GroE. Our data suggest that two clades of mollicutes re-acquired the GroE system, thereby supporting the view that gaining the system occurred polyphyletically and not monophyletically, as previously debated. Our data also suggest that there might have been three isolated cases of lateral gene transfer from specific bacterial sources. Taken together, our data indicate that loss of GroE does not involve crossing a high evolutionary barrier and can be compensated for by a small number of changes within the few dozen client proteins.
Co-evolution analysis can be used to accurately predict residue-residue contacts from multiple sequence alignments. The introduction of machine-learning techniques has enabled substantial improvements in precision and a shift from predicting binary contacts to predicting distances between pairs of residues. These developments have significantly improved the accuracy of de novo prediction of static protein structures. Here we examine the potential of these residue-residue distance predictions to predict protein flexibility rather than static structure. We used DMPfold to predict distance distributions for every residue pair in a set of proteins that showed both rigid and flexible behaviour. Residue pairs that were in contact in at least one reference structure were considered and classified as rigid, flexible or neither. The predicted distance distribution of each residue pair was analysed for local maxima of probability indicating the most likely distance or distances between a pair of residues. The average number of local maxima per residue pair was found to be different between the sets of rigid and flexible residue pairs. Flexible residue pairs more often had multiple local maxima in their predicted distance distribution than rigid residue pairs suggesting that the shape of predicted distance distributions is predictive of rigidity or flexibility of residue pairs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.