A genomic clone encoding the Drosophila Ul small nuclear ribonucleoprotein particle 70K protein was isolated by hybridization with a human Ul small nuclear ribonucleoprotein particle 70K protein cDNA. Southern blot and in situ hybridizations showed that this Ul 70K gene is unique in the Drosophila genome, residing at cytological position 27D1,2. Polyadenylated transcripts of 1.9 and 3.1 kilobases were observed. While the 1.9-kilobase mRNA is always more abundant, the ratio of these two transcripts is developmentally regulated. Analysis of cDNA and genomic sequences indicated that these two RNAs encode an identical protein with a predicted molecular weight of 52,879. Comparison of the Ul 70K proteins predicted from Drosophila, human, and Xenopus cDNAs revealed 68% amino acid identity in the most amino-terminal 214 amino acids, which include a sequence motif common to many proteins which bind RNA. The carboxy-terminal half is less well conserved but is highly charged and contains distinctive arginine-rich regions in all three species. These arginine-rich regions contain stretches of arginine-serine dipeptides like those found in transformer, transformer-2, and suppressor-of-white-apricot proteins, all of which have been identified as regulators of mRNA splicing in Drosophila melanogaster.Splicing of pre-mRNA occurs in a two-step reaction requiring the assembly of a large complex (the spliceosome) which includes several small nuclear ribonucleoprotein particles (snRNPs) (see references 19, 32, and 52 for reviews). Each snRNP consists of at least one snRNA bound to several proteins, some of which are common to all the spliceosomal snRNPs. The human Ul snRNP is composed of a 164-nucleotide (nt) Ul snRNA molecule complexed to at least 10 proteins (30). It binds to 5' splice site sequences by base pairs involving the 5' end of Ul RNA (12,24,35,62). There is also evidence that the Ul snRNP interacts with the U2 snRNP during splicing (8,12), and that this interaction occurs early and is required for further stages of splicing (25,48,50). Three of the human Ul snRNP proteins, 70K, A, and C, are unique to the Ul snRNP (21, 43); the other Ul snRNP proteins are common to the U2, U5, and U4/U6 snRNPs.Patients with autoimmune disorders produce antibodies which specifically immunoprecipitate the Ul snRNP (28), and many of these are directed against epitopes on the 70K protein (43, 60). Anti-RNP antisera have been shown to block in vitro splicing when they are added to splicing reactions (40). Such sera, as well as a mouse monoclonal antibody to the mouse Ul snRNP 70K protein (6), have enabled researchers to clone cDNAs encoding the Ul 70K protein from expression libraries (47,59).Sequence data from these cDNAs have provided information regarding the primary structure of the Ul 70K protein.First, it appears that this protein is actually 52 kilodaltons in size (45, 53). We have followed others (15, 46) in using Ul 70K or 70K as a name for this protein, although this name says nothing about the size of the protein. A conserved reg...