The gene for human apolipoprotein E (apo-E) was selected from a library of cloned genomic DNA by screening with a specific cDNA hybridization probe, and its structure was characterized. The complete nucleotide sequence of the gene as well as 856 nucleotides of the 5' flanking region and 629 nucleotides of the 3' flanking region were determined. Analysis of the sequence showed that the mRNA-encoding region of the apo-E gene consists of four exons separated by three introns. In comparison to the structure of the mRNA, the introns are located in the 5' noncoding region, in the codon for glycine at position -4 of the signal peptide region, and in the codon for arginine at position +61 of the mature protein. The overall lengths of the apo-E gene and its corresponding mRNA are 3597 and 1163 nucleotides, respectively; a mature plasma protein of 299 amino acids is produced by this gene. Examination of the 5' terminus of the gene by S1 nuclease mapping shows apparent multiple transcription initiation sites. The proximal 5' flanking region contains a "TATA box" element as well as two nearby inverted repeat elements. In addition, there are four Alu family sequences associated with the apo-E gene: an Alu sequence located near each end of the gene and two Alu sequences located in the second intron. This knowledge of the structure permits a molecular approach to characterizing the regulation of the apo-E gene.Apolipoprotein E (apo-E) is a component of various classes of plasma lipoproteins in all mammals that have been studied (for review, see refs. 1 and 2). It is a single chain polypeptide (Mr, 34,000) of 299 amino acids (3) that is synthesized initially with an 18-residue signal peptide that is removed cotranslationally (4,5). The amino acid sequence as well as the mRNA nucleotide sequence are known for both the human (3, 6) and rat (7) species. The major site of synthesis is the liver, but relatively abundant levels of apo-E mRNA have been detected in many extrahepatic tissues, including the brain and the adrenals (8). In addition, apo-E is produced by mouse peritoneal macrophages, as well as human monocytederived macrophages (9).A major function of apo-E is its mediation of the cellular uptake of specific lipoproteins through an interaction with apo-B,E(LDL) receptors on extrahepatic and hepatic cell surfaces and with distinct hepatic apo-E receptors (for review, see ref. 10). The receptor binding domain of human apo-E has been determined to be an arginine-and lysine-rich region in the vicinity of residues 140 and 160 (11,12). Variant forms of apo-E with single amino acid substitutions in this region show decreased receptor binding activity (13)(14)(15) and are associated with type III hyperlipoproteinemia and accelerated cardiovascular disease (for review, see refs. 16 and 17). Apolipoprotein E with normal receptor binding activity is found in two common isoforms, the E3 and E4 phenotypes, with either cysteine or arginine, respectively, at residue position 112 (13).Because of the central role that apo-E plays i...