The ankyrin repeat is one of the most frequently observed amino acid motifs in protein databases. This protein-protein interaction module is involved in a diverse set of cellular functions, and consequently, defects in ankyrin repeat proteins have been found in a number of human diseases. Recent biophysical, crystallographic, and NMR studies have been used to measure the stability and define the various topological features of this motif in an effort to understand the structural basis of ankyrin repeat-mediated protein-protein interactions. Characterization of the folding and assembly pathways suggests that ankyrin repeat domains generally undergo a two-state folding transition despite their modular structure. Also, the large number of available sequences has allowed the ankyrin repeat to be used as a template for consensusbased protein design. Such projects have been successful in revealing positions responsible for structure and function in the ankyrin repeat as well as creating a potential universal scaffold for molecular recognition.
The ankyrin repeat is one of the most common, modular, proteinprotein interaction motifs in nature. To understand the structural determinants of this family of proteins and extract the consensus information that defines the architecture of this motif, we have designed a series of idealized ankyrin repeat proteins containing one, two, three, or four repeats by using statistical analysis of Ϸ4,000 ankyrin repeat sequences from the PFAM database. Biophysical and x-ray crystallographic studies of the three and four repeat constructs (3ANK and 4ANK) to 1.26 and 1.5 Å resolution, respectively, demonstrate that these proteins are well-folded, monomeric, display high thermostability, and adopt a very regular, tightly packed ankyrin repeat fold. Mapping the degree of amino acid conservation at each position on the 4ANK structure shows that most nonconserved residues are clustered on the surface of the molecule that has been designated as the binding site in naturally occurring ankyrin repeat proteins. Thus, the consensus amino acid sequence contains all information required to define the ankyrin repeat fold. Our results suggest that statistical analysis and the consensus sequence approach can be used as an effective method to design proteins with complex topologies. These generic ankyrin repeat proteins can serve as prototypes for dissecting the rules of molecular recognition mediated by ankyrin repeats and for engineering proteins with novel biological functions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.