Combinatorial mutagenesis with an alphabet limited to alanine, glutamic acid, lysine, and threonine was used to probe the role of interactions involving surface residues in stabilizing a short a-helical coiled coil. The residues at eight e and g positions in the leucine zipper of the Saccharomyces cerevisiue transcription factor GCN4 were randomized to these four residues in a h repressor-leucine zipper fusion protein, resulting in 65,536 possible residue combinations. Roughly three-fourths of these combinations allowed stable coiled-coil formation as assayed by DNA binding by the fusion protein. To understand the basis for the activity differences, functional and nonfunctional mutants were sequenced and statistical tests were applied to identify structure/function correlations. Helix-forming propensity and favorable intrasubunit and intersubunit charge-charge interactions were positively correlated with activity. These studies suggest that the identities of surface side chains at the e and g positions of coiled coils contribute modestly to stability; by comparison with previous work, however, the e and g positions are far less critical than residues at the a and d positions, which form the hydrophobic core of the dimer interface.
BackgroundInteractions that involve one or more amino acid side chains near the ends of protein helices stabilize helix termini and shape the geometry of the adjacent loops, making a substantial contribution to overall protein structure. Previous work has identified key helix-terminal motifs, such as Asx/ST N-caps, the capping box, and hydrophobic and electrostatic interactions, but important questions remain, including: 1) What loop backbone geometries are favoured by each motif? 2) To what extent are multi-amino acid motifs likely to represent genuine cooperative interactions? 3) Can new motifs be identified in a large, recent dataset using the latest bioinformatics tools?ResultsThree analytical tools are applied here to answer these questions. First, helix-terminal structures are partitioned by loop backbone geometry using a new 3D clustering algorithm. Next, Cascade Detection, a motif detection algorithm recently published by the author, is applied to each cluster to determine which sequence motifs are overrepresented in each geometry. Finally, the results for each motif are presented in a CapMap, a 3D conformational heatmap that displays the distribution of the motif’s overrepresentation across loop geometries, enabling the rapid isolation and characterization of the associated side chain interaction. This work identifies a library of geometry-specific side chain interactions that provides a new, detailed picture of loop structure near the helix terminus. Highlights include determinations of the favoured loop geometries for the Asx/ST N-cap motifs, capping boxes, “big” boxes, and other hydrophobic, electrostatic, H-bond, and pi stacking interactions, many of which have not been described before.ConclusionsThis work demonstrates that the combination of structural clustering and motif detection in the sequence space can efficiently identify side chain motifs and map them to the loop geometries which they support. Protein designers should find this study useful, because it identifies side chain interactions which are good candidates for inclusion in synthetic helix-terminal loops with specific desired geometries, since they are used in nature to support these geometries. The techniques described here can also be applied to map side chain interactions associated with other structural components of proteins such as beta and gamma turns.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-015-0671-4) contains supplementary material, which is available to authorized users.
Supplementary information is available at Bioinformatics online.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.