The suicide inhibitory mechanism of serine protease inhibitors of the serpin superfamily depends heavily on their structural flexibility, which is controlled in large part by the breach and shutter regions of the central A-sheet. We examined codon usage by the highly conserved residues, Ser-53 and Ser-56, of the shutter region and found a TCN-AGY usage dichotomy for Ser-56 that remarkably is linked to the protostome-deuterostome split. Our results suggest that serpin evolution was driven by phylogenetic speciation and not pressure to fulfill new physiologic functions mitigating against coevolution with the family of serine proteases they inhibit.The serpin superfamily features proteins that participate in a wide variety of physiologic and cellular functions ranging from blood clotting to molecular chaperoning (1). The functional mechanism of the majority of serpins is suicide inhibition of serine proteases of clan PA, the chymotrypsin-like clan (2). Serpins have a diverse species distribution with members of the superfamily found in plants, viruses, archaea, nematodes, arthropods, and higher animals. Thus, serpins are similar to the class of proteases they inhibit in terms of their functional and species diversity. Unlike chymotrypsin-like proteases, however, inhibitory serpins require a high degree of conformational flexibility for normal function (3). The inhibitor must be able to achieve a 70-Å translocation of the protease that is accompanied by significant structural changes (4). Disruption of this structural flexibility has been linked to a variety of disease states characterized by protease-antiprotease imbalance (5). The key role of conformational flexibility raises the possibility that the evolutionary history of the serpins is connected to regions of the molecule that are associated with maintaining flexibility.Discrete evolutionary markers, in the form of conserved amino acid residues that also display dichotomous sequence choices, have been used to reconstruct the evolutionary history of clan PA serine proteases (6). Such markers can also be identified for the serpin superfamily to obtain phylogenetic information that extends beyond measurement of sequence similarity. The information obtained from those markers could reveal whether the evolution of serpins correlates with that of the proteases they inhibit. Here we search for discrete phylogenetic markers to address the roles of structural flexibility and coevolution with proteases in the evolutionary history of the serpin superfamily.
MATERIALS AND METHODSAmino acid and nucleotide sequences of Ͼ200 serpins were culled from GenBank TM at www.ncbi.nlm.nih.gov/Entrez. Genome sequences were translated and then aligned using ClustalW (7) to identify the codons and amino acid choices for residues of interest. For calculation of protein distance matrices, nearly identical sequences were eliminated from alignments to minimize duplication. Non-inhibitory members of the serpin superfamily were also eliminated, leaving a pool of 74 serpins as the basis for su...