A protein that binds specifically to Saccharomyces cerevisiae centromere DNA element I was purified on the basis of a nitrocellulose filter-binding assay. This protein, termed centromere-binding protein 1 (CP1), was heat stable and renaturable from sodium dodecyl sulfate (SDS), and assays of eluates from SDS gels indicated a molecular weight of 57,000 to 64,000. An activity with similar specificity and stability was detected in human lymphocyte extracts, and analysis in SDS gels revealed a molecular weight of 39,000 to 49,000. CP1-binding sites occurred not only at centromeres but also near many transcription units, for example, adjacent to binding sites for the GAL4-positive regulatory protein upstream of the GAL2 gene in S. cerevisiae and adjacent to the TATA element of the adenovirus major late promoter. A factor (termed USF) that binds to the latter site and stimulates transcription has been isolated from HeLa cells by others.Centromeres are specialized regions of eucaryotic chromosomes that form sites of attachment for spindle microtubules in mitosis and meiosis (13,35,36). Little is known of the mechanism of spindle attachment or the molecules involved. We report here on the isolation of a centromere DNA-binding protein from Saccharomyces cerevisiae and its counterpart from human cells.Cloning and deletion analysis of yeast centromeres revealed three contiguous conserved sequences essential to or important for centromere function: centromere DNA element I (CDEI), an 8-base-pair (bp) conserved sequence; CDEII, an apparently random, adenine-plus-thymine (A + T)-rich stretch of approximately 90 bp; and CDEIII, a 25-bp conserved sequence (11,12,17,18,20,21,24,29,32,33,39). Nuclease digestion studies of chromatin containing centromere regions CEN3 and CENII revealed an unusual configuration: 250 bp, including the three conserved DNA elements, are especially protected from digestion (4). Crude fractions from S. cerevisiae bind naked DNA containing centromere sequences (5), but no resolution of the proteins involved has been described.A centromere-specific DNA-binding activity was discovered during recent studies of the GAL4-positive regulatory protein (7
MATERIALS AND METHODSPlasmid DNAs. Plasmid pG2p contains the 5' half of the GAL2 gene (7). Plasmids p181, p113, p200, p201, p203, p202, and p182 containing HindlIl fragments of CENI, CEN4, CEN7, CENII, CEN14, CEN15, and 2,um-STB in YRpJ41ARS1 were kindly provided by P. Hieter (20).Yeast strains and media. BY2 (6) is a protease-deficient strain (pep4-3) carrying the GAL4 gene on a high-copynumber plasmid. YNN 267 (a ura3-52 Ahis3-200 ade2-101 lys2-801 met-Agal4-537) was a gift from M. Johnston. BJ926 (a trpl prcl -126 pep4-3 prbl -1122 can1la his] prcl -126 pep4-3 prbl-1122 can]) was kindly provided by E. Jones. Cells were grown in YP medium (1% yeast extract, 2%Bacto-Peptone) containing 2% glucose to an A6w of 5.Nitrocellulose filter-binding assay and DNase I footprints. Plasmid pG2p DNA was linearized by cleavage with EcoRI, and the recessed 3' ends were fille...