Using an interaction blot approach to search in the human nuclear proteome, we identified eight novel proteins that bind the hyperphosphorylated C-terminal repeat domain (phosphoCTD) of RNA polymerase II. Unexpectedly, five of the new phosphoCTD-associating proteins (PCAPs) represent either enzymes that act on DNA and chromatin (topoisomerase I, DNA (cytosine-5) methyltransferase 1, poly(ADP-ribose) polymerase-1) or proteins known to bind DNA (heterogeneous nuclear ribonucleoprotein (hnRNP) U/SAF-A, hnRNP D). The other three PCAPs represent factors involved in pre-mRNA metabolism as anticipated (CA150, NSAP1/hnRNP Q, hnRNP R) (note that hnRNP U/SAF-A and hnRNP D are also implicated in pre-mRNA metabolism). Identifying as PCAPs proteins involved in diverse DNA transactions suggests that the range of phosphoCTD functions extends far beyond just transcription and RNA processing. In view of the activities possessed by the DNA-directed PCAPs, it is likely that the phosphoCTD plays important roles in genome integrity, epigenetic regulation, and potentially nuclear structure. We present a model in which the phosphoCTD association of the PCAPs poises them to act either on the nascent transcript or on the DNA/chromatin template. We propose that the phosphoCTD of elongating RNA polymerase II is a major organizer of nuclear functions.
Molecular & Cellular Proteomics 1:598 -610, 2002.The C-terminal repeat domain (CTD), 1 a non-catalytic "tail" on the largest subunit of RNA polymerase, is a unique structure made up of as many as 52 repeats of the consensus heptamer YSPTSPS (1). That each seven-residue repeat contains five hydroxyl-containing side chains suggests that the CTD could be a good target for phosphorylation, and indeed, in elongating RNAP II the CTD is hyperphosphorylated (2-4); in preinitiating RNAP II, on the other hand, the CTD is not phosphorylated (e.g. see Ref. 5). The amino acid composition of the CTD also renders it very hydrophilic, and it is largely unstructured in aqueous solution (6, 7); in mammals a fully stretched out CTD could potentially extend some 1200 Å away from the globular body of the polymerase (whose diameter is ϳ150 Å, Ref. 8) (see e.g. Ref. 9). Its unusual properties render the CTD well suited to function as an extended scaffold to which transcription factors and other proteins could bind; indeed, a growing body of data indicates that such a scaffolding role is the principal function of the CTD. Experiments focusing on the hyperphosphorylated CTD characteristic of elongating RNAP II have revealed that a number of RNA processing factors bind to the phosphoCTD (e.g. see Refs. 10 and 11). Binding to the phosphoCTD tethers such factors to the transcription machinery where they can carry out their functions more efficiently (e.g. see Refs. 12-16); furthermore, certain factors or processes are stimulated by phosphoCTD binding (e.g. see . In addition, a pool of nontranscribing RNAP II carries a phosphoCTD and is associated with certain transcription and processing factors in potential assembly ar...