Protein-nucleic acid interactions are responsible for the regulation of key biological events such as genomic transcription and recombination and viral replication. However, the recognition mechanisms involved in these processes are not completely understood. Here, we investigate the dominant forces involved in protein-protein and protein-DNA interactions for the 80-amino-acid C-terminal domain of the E2 protein (E2c) from human papillomavirus (HPV-16). The E2c protein is a homodimer that specifically binds to double-stranded DNA containing the consensus sequence ACCG-N 4 -CGGT, where N is any nucleotide. DNA binding affinity is reduced by lowering water chemical potential, accompanied by an increase in cooperativity. Wyman linkage relations between affinity and water chemical potential indicate that 11 additional water molecules are bound in the formation of the complex between E2c and DNA. Salt dissociation isotherms showed that 10 counterions are released upon association, even at low water activity, indicating that this latter variable does not change the electrostatic component of the interaction. Further analysis demonstrates a strong dependence of cooperativity of binding on the protein concentration. Altogether, these results reveal a novel binding pathway in which the consolidated complex may achieve its final form via a monomer-DNA intermediate, which favors the binding of a second monomer. This molecular mechanism reveals the contributions of multiple conformers in a tight virus genome modulation that seems to be important in the cell infection scenario.
Human papillomavirus (HPV)1 is a small nonenveloped double-stranded DNA virus that causes epithelial lesions, which often develop into malignancies (1). The viruses HPV 16, -18, and -31 are denoted high risk HPV since they are directly related to cervical cancer. The regulation of gene expression in virus-infected cells is mediated by the factor E2, which controls the transcription of the HPV gene, including those related to malignant transformation. E2 can act either as an activator or as a repressor, depending on the alternate splicing of the product. Infected cells can be found containing (i) the C-terminal domain (E2c), responsible for DNA binding (2) and dimerization (3); (ii) the E2-E8 splicing product; and (iii) the full E2 protein (E2-full), consisting of the E2c, the N-terminal transactivation domain (E2TA), and an unstructured region with little known function. These three components recognize and bind specifically the palindromic sequence ACCGN 4 CGGT (2), where N is any base, although differences in affinity can arise depending on the N bases (4 -7). Part of the E2 structure has been solved: both for E2c, the DNA-binding domain (7-10), and for the transactivation domain (11,12).Much has appeared in the literature about the behavior of the E2c protein from the high risk HPV-16, in terms of homoassociation (13-16) and DNA binding and recognition, through equilibrium (6, 9, 14, 16 -20) and kinetic assays (4, 21). Although we have previously p...