Inteins are mobile genetic elements capable of self-splicing post-translationally. They exist in all three domains of life including in viruses and bacteriophage, where they have a sporadic distribution even among very closely related species. In this review, we address this anomalous distribution from the point of view of the evolution of the host species as well as the intrinsic features of the inteins that contribute to their genetic mobility. We also discuss the incidence of inteins in functionally important sites of their host proteins. Finally, we describe instances of conditional protein splicing. These latter observations lead us to the hypothesis that some inteins have adapted to become sensors that play regulatory roles within their host protein, to the advantage of the organism in which they reside.
Protein SplicingProtein splicing is a naturally occurring biochemical process that mediates the post-translational conversion of a precursor polypeptide into a mature and functional protein through the removal of an internal protein element, called an intein (Fig. 1A). The process is analogous to intron splicing at the RNA level. The protein splicing mechanism involves a series of autocatalytic peptide bond rearrangements, where the intein excises itself from the precursor polypeptide with concurrent ligation of the flanking sequences, called exteins (N-or C-exteins relative to the position of intein) (for review, see Refs. 1-3). As a result of this process, two proteins are produced from a single polypeptide product. The term intein refers to both the genetic element in the DNA or RNA and the protein splicing entity.Most inteins are expressed within a single polypeptide chain (cis-splicing inteins), but some are split into two polypeptides each containing one extein and an intein fragment (trans-splicing inteins) (4, 5). In the case of split inteins, reassociation of the fragments at a zipper-like interface (6) precedes protein splicing (Fig. 1B). Both cis-splicing and trans-splicing inteins are frequently utilized in various biotechnological applications including protein purification, modification, labeling, and posttranslational control of expressed proteins (reviewed in Refs. 7 and 8). The cis-splicing inteins often contain a distinct homing endonuclease (HEN) 2 domain (9). HEN-containing inteins are naturally occurring mobile genetic elements. The presence of a HEN provides inteins the ability to transfer their coding elements into homologous alleles at homing sites that lack the intein sequence (Fig. 1D). This HEN-mediated homing process can result in horizontal gene transfer (HGT) of inteins, by invasion of diverse species, followed by vertical transmission of inteins (10, 11). Moreover, HEN-containing inteins are involved in a so-called "homing cycle" that includes two opposing processes, precise intein loss and reinvasion of a newly formed vacant homing site. It is believed that the homing cycle allows the HEN to avoid fixation and functional decay in one locus (12).