Genomic and two novel cDNA clones for rice seed allergenic protein (RA) belonging to the alpha-amylase/trypsin inhibitor family were isolated and their nucleotide sequences determined. Ten cysteine residues deduced from nucleotide sequences were completely conserved among three cDNA clones including a clone, RA17, reported previously. One genomic clone, lambda 4, contained two RA genes, RAG1 and RAG2. Although RAG1 was cloned at the 5' portion only, two RA genes were arranged divergently. Nucleotide sequencing and DNA blotting analyses showed that RA are encoded by a multigene family consisting of at least four members. The transcriptional initiation site of RAG1 was localized at A, 26 bp upstream of the putative translational initiation codon, ATG, by the primer extension assay. The putative TATA box and CAAT box existed about 45 bp and 147 bp upstream of the transcription initiation site, respectively. A conserved sequence (ATGCAAAA) which was similar to the sequence (TGCAAAA) identified in rice glutelin promoters was observed in the 5' region of the two genes. In addition, RNA blotting analyses provided that RA genes specifically expressed in ripening seed and their transcripts accumulated maximally between 15 and 20 days after flowering.