Repeat proteins are a special class of modular, non-globular proteins composed of small structural motifs arrayed to form elongated architectures and stabilised solely by short-range contacts. We find a remarkable complexity in the unfolding of the large HEAT repeat protein PR65/A. In contrast to what has been seen for small repeat proteins in which unfolding propagates from one end, the HEAT array of PR65/A ruptures at multiple distant sites, leading to intermediate states with non-contiguous folded subdomains. Kinetic analysis allows us to define a network of intermediates and to delineate the pathways that connect them. There is a dominant sequence of unfolding, reflecting a non-uniform distribution of stability across the repeat array; however the unfolding of certain intermediates is competitive, leading to parallel pathways. Theoretical models accounting for the heterogeneous contact density in the folded structure are able to rationalize the variation in stability across the array. This variation in stability also suggests how folding may direct function in a large repeat protein: The stability distribution enables certain regions to present rigid motifs for molecular recognition while affording others flexibility to broaden the search area as in a fly-casting mechanism. Thus PR65/A uses the two ends of the repeat array to bind diverse partners and thereby coordinate the dephosphorylation of many different substrates and of multiple sites within hyperphosphorylated substrates.
Knots in proteins are hypothesized to make them resistant to enzymatic degradation by ATP-dependent proteases and recent studies have shown that whereas ClpXP can easily degrade a protein with a shallow 31 knot, it cannot degrade 52-knotted proteins if degradation is initiated at the C-terminus. Here, we present detailed studies of the degradation of both 31- and 52-knotted proteins by ClpXP using numerous constructs where proteins are tagged for degradation at both N- and C-termini. Our results confirm and extend earlier work and show that ClpXP can easily degrade a deeply 31-knotted protein. In contrast to recently published work on the degradation of 52-knotted proteins, our results show that the ClpXP machinery can also easily degrade these proteins. However, the degradation depends critically on the location of the degradation tag and the local stability near the tag. Our results are consistent with mechanisms in which either the knot simply slips along the polypeptide chain and falls off the free terminus, or one in which the tightened knot enters the translocation pore of ClpXP. Results of experiments on knotted protein fusions with a highly stable domain show partial degradation and the formation of degradation intermediates.
Protein energy landscapes are highly complex, yet the vast majority of states within them tend to be invisible to experimentalists. Here, using site-directed mutagenesis and exploiting the simplicity of tandem-repeat protein structures, we delineate a network of these states and the routes between them. We show that our target, gankyrin, a 226-residue 7-ankyrin-repeat protein, can access two alternative (un)folding pathways. We resolve intermediates as well as transition states, constituting a comprehensive series of snapshots that map early and late stages of the two pathways and show both to be polarized such that the repeat array progressively unravels from one end of the molecule or the other. Strikingly, we find that the protein folds via one pathway but unfolds via a different one. The origins of this behavior can be rationalized using the numerical results of a simple statistical mechanics model that allows us to visualize the equilibrium behavior as well as single-molecule folding/unfolding trajectories, thereby filling in the gaps that are not accessible to direct experimental observation. Our study highlights the complexity of repeat-protein folding arising from their symmetrical structures; at the same time, however, this structural simplicity enables us to dissect the complexity and thereby map the precise topography of the energy landscape in full breadth and remarkable detail. That we can recapitulate the key features of the folding mechanism by computational analysis of the native structure alone will help towards the ultimate goal of designed amino-acid sequences with made-to-measure folding mechanisms -the Holy Grail of protein folding.
Studying protein folding and protein design in globular proteins presents significant challenges because of the two related features, topological complexity and co-operativity. In contrast, tandem-repeat proteins have regular and modular structures composed of linearly arrayed motifs. This means that the biophysics of even giant repeat proteins is highly amenable to dissection and to rational design. Here we discuss what has been learnt about the folding mechanisms of tandem-repeat proteins. The defining features that have emerged are: (i) accessibility of multiple distinct routes between denatured and native states, both at equilibrium and under kinetic conditions; (ii) different routes are favoured for folding compared with unfolding; (iii) unfolding energy barriers are broad, reflecting stepwise unravelling of an array repeat by repeat; (iv) highly co-operative unfolding at equilibrium and the potential for exceptionally high thermodynamic stabilities by introducing consensus residues; (v) under force, helical-repeat structures are very weak with non-co-operative unfolding leading to elasticity and buffering effects. This level of understanding should enable us to create repeat proteins with made-to-measure folding mechanisms, in which one can dial into the sequence the order of repeat folding, number of pathways taken, step size (co-operativity) and fine-structure of the kinetic energy barriers.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with đź’™ for researchers
Part of the Research Solutions Family.