BackgroundDNA methylation (5mC) plays important roles in epigenetic regulation of genome function. Recently, TET hydroxylases have been found to oxidise 5mC to hydroxymethylcytosine (5hmC), formylcytosine (5fC) and carboxylcytosine (5caC) in DNA. These derivatives have a role in demethylation of DNA but in addition may have epigenetic signaling functions in their own right. A recent study identified proteins which showed preferential binding to 5-methylcytosine (5mC) and its oxidised forms, where readers for 5mC and 5hmC showed little overlap, and proteins bound to further oxidation forms were enriched for repair proteins and transcription regulators. We extend this study by using promoter sequences as baits and compare protein binding patterns to unmodified or modified cytosine using DNA from mouse embryonic stem cell extracts.ResultsWe compared protein enrichments from two DNA probes with different CpG composition and show that, whereas some of the enriched proteins show specificity to cytosine modifications, others are selective for both modification and target sequences. Only a few proteins were identified with a preference for 5hmC (such as RPL26, PRP8 and the DNA mismatch repair protein MHS6), but proteins with a strong preference for 5fC were more numerous, including transcriptional regulators (FOXK1, FOXK2, FOXP1, FOXP4 and FOXI3), DNA repair factors (TDG and MPG) and chromatin regulators (EHMT1, L3MBTL2 and all components of the NuRD complex).Conclusions0ur screen has identified novel proteins that bind to 5fC in genomic sequences with different CpG composition and suggests they regulate transcription and chromatin, hence opening up functional investigations of 5fC readers.
SP1 is a ubiquitous transcription factor that is involved in the regulation of various house-keeping genes. It is known that it acts by binding to a double-stranded consensus motif. Here, we have discovered that SP1 binds also to a non-canonical DNA structure, a G-quadruplex, with high affinity. In particular, we have studied the SP1 binding site within the promoter region of the c-KIT oncogene and found that this site can fold into an anti-parallel two-tetrad G-quadruplex. SP1 pull-down experiments from cellular extracts, together with biophysical binding assays revealed that SP1 has a comparable binding affinity for this G-quadruplex structure and the canonical SP1 duplex sequence. Using SP1 ChIP-on-chip data sets, we have also found that 87% of SP1 binding sites overlap with G-quadruplex forming sequences. Furthermore, while many of these immuoprecipitated sequences (36%) even lack the minimal SP1 consensus motif, 5′-GGGCGG-3′, we have shown that 77% of them are putative G-quadruplexes. Collectively, these data suggest that SP1 is able to bind both, canonical SP1 duplex DNA as well as G-quadruplex structures in vitro and we hypothesize that both types of interactions may occur in cells.
The modified base 5-formylcytosine (5fC) was recently identified in mammalian DNA and might be considered as the “seventh” base of the genome. This nucleotide has been implicated in active demethylation mediated by the base excision repair enzyme thymine DNA glycosylase (TDG). Genomics and proteomics studies have suggested a further role for 5fC in transcription regulation through chromatin remodeling. Herein we propose how 5fC might signal these processes through its effect on DNA conformation. Biophysical and structural analysis revealed that 5fC alters the structure of the DNA double helix leading to a conformation unique amongst known DNA structures including those comprising other cytosine modifications. The 1.4 Å resolution X-ray crystal structure of a DNA dodecamer comprising three 5fCpG sites shown how 5fC changes the geometry of the grooves and base pairs associated with the modified base, which lead to helical under-winding.
It all clicks into place: A potent telomere‐targeting small molecule has been identified by using the copper‐free 1,3‐dipolar cycloaddition of a series of alkyne and azide building blocks catalyzed by a non‐Watson–Crick DNA secondary structure (see picture). This method rapidly identifies, otherwise unanticipated, potent small‐molecule probes to selectively target a given RNA or DNA.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.