An RNA recognition motif (RRM) of '480 amino acids constitutes the core of RNA-binding domains found in a large family of proteins involved in RNA processing.The Ul RNA-binding domain ofthe A protein component ofthe human Ul small nuclear ribonucleoprotein (RNP), which encompasses the RRM sequence, was analyzed by using NMR spectroscopy. The domain of the A protein is a highly stable monomer in solution consisting of four antiparallel f-strands and two a-helices. The highly conserved RNP1 and RNP2 consensus sequences, containing residues previously suggested to be involved in nucleic acid binding, are juxtaposed in adjacent f-strands. Conserved aromatic side chains that are critical for RNA binding are clustered on the surface of the molecule adjacent to a variable loop that influences recognition of specific RNA sequences. The secondary structure and topology of the RRM are similar to those of ribosomal proteins L12 and L30, suggesting a distant evolutionary relationship between these two types of RNA-associated proteins.Over the past decade, the ways in which proteins interact specifically and nonspecifically with DNA have been revealed by biochemical and biophysical studies (reviewed in refs. 1 and 2). Much of this information has been the result of crystallographic analysis, but more recently, NMR techniques have been used (3). As regards RNA-binding proteins, however, there has been little structural information with the notable exception of the work by Steitz and coworkers (4) on the glutaminyl-tRNA synthetase/tRNAGIn complex.Sequence elements characteristic of a group of RNAassociated proteins began to emerge with the observation of four copies of an 80-amino acid repeat in poly(A)-binding protein (5). The most conserved region of 8 amino acids in these repeats was termed the ribonucleoprotein (RNP) consensus sequence, or octamer (RNP1) (6). Another 6-amino acid region was later termed RNP2 (7). It was subsequently shown by several groups that the sequence similarity appears in many other RNA-associated proteins (7-11) (recently reviewed in ref. 12), contains many conserved positions, and is notably rich in aromatic residues (8). Direct evidence of an interaction between this motif and RNA has led to its designation as an RNA recognition motif, or RRM (8). This motif has also been referred to as an RNP consensus sequence-type RNA binding domain (11) and as an RNP-80 motif (13). Members of the RRM-containing family of proteins include heterogeneous nuclear RNP proteins and U-class small nuclear RNP (snRNP) proteins, as well as poly(A)-binding protein, nucleolin, transcription termination factors La and rho, eukaryotic initiation factor 4B, and sexdetermination proteins in Drosophila. The sources of these proteins range from Escherichia coli to man; thus, the RRM is clearly an ancient protein structure.The RRM can be viewed as analogous to the homeobox domain (5) or to the zinc-binding finger motif (14) of DNAbinding proteins. In many cases, the RRM is a modular component ofa larger protein; other m...