Initiation of simian virus 40 (SV40) DNA replication is dependent upon the assembly of two T-antigen (T-ag)hexamers on the SV40 core origin. To further define the oligomerization mechanism, the pentanucleotide requirements for T-ag assembly were investigated. Here, we demonstrate that individual pentanucleotides support hexamer formation, while particular pairs of pentanucleotides suffice for the assembly of T-ag double hexamers. Related studies demonstrate that T-ag double hexamers formed on "active pairs" of pentanucleotides catalyze a set of previously described structural distortions within the core origin. For the fourpentanucleotide-containing wild-type SV40 core origin, footprinting experiments indicate that T-ag double hexamers prefer to bind to pentanucleotides 1 and 3. Collectively, these experiments demonstrate that only two of the four pentanucleotides in the core origin are necessary for T-ag assembly and the induction of structural changes in the core origin. Since all four pentanucleotides in the wild-type origin are necessary for extensive DNA unwinding, we concluded that the second pair of pentanucleotides is required at a step subsequent to the initial assembly process.The protein-DNA interactions that take place at eukaryotic origins of DNA replication are poorly characterized. This situation reflects, in part, the failure to identify DNA sequences that constitute higher eukaryotic origins of replication (13). Moreover, there is limited structural information about the initiator proteins that recognize origins of replication (35, 67). Indeed, structural information is currently limited to the origin binding domains of initiators encoded by simian virus 40 (SV40) (45), bovine papillomavirus (33), and Epstein-Barr virus (1, 2).In view of these limitations, a useful model for studies of the protein-DNA interactions that take place at eukaryotic origins is the binding of the virus-encoded T-antigen (T-ag) to the SV40 origin of replication. The well-characterized SV40 core origin is 64 bp long and consists of three separate domains (20,21). The central region, termed site II (or pentanucleotide palindrome), contains four GAGGC pentanucleotides that serve as binding sites for T-ag (24, 69, 70). Site II is flanked by a 17-bp adenine-thymine (AT)-rich domain and the early palindrome (EP) (reviewed in references 4 and 29).T-ag, a 708-amino-acid phosphoprotein, has been extensively studied (reviewed in references 4, 9, and 29). The structure of the T-ag domain that is necessary and sufficient for binding to the SV40 origin, T-ag-obd , was solved by use of nuclear magnetic resonance techniques (45). When the structure was viewed in terms of previous mutagenesis studies of T-ag (65, 76), considerable insight into the mechanism of binding of T-ag to individual pentanucleotides was obtained. For example, it is now apparent that site-specific binding is mediated by a pair of loops (45), a common motif in protein-DNA interactions (9, 41). Additional insights into T-ag binding to individual pentanucleotides, ...