HOT (highly occupied target) regions bound by many transcription factors are considered to be one of the most intriguing findings of the recent modENCODE reports, yet their functions have remained unclear. We tested 108 Drosophila melanogaster HOT regions in transgenic embryos with site-specifically integrated transcriptional reporters. In contrast to prior expectations, we found 102 (94%) to be active enhancers during embryogenesis and to display diverse spatial and temporal patterns, reminiscent of expression patterns for important developmental genes. Remarkably, HOT regions strongly activate nearby genes and are required for endogenous gene expression, as we show using bacterial artificial chromosome (BAC) transgenesis. HOT enhancers have a distinct cis-regulatory signature with enriched sequence motifs for the global activators Vielfaltig, also known as Zelda, and Trithoraxlike, also known as GAGA. This signature allows the prediction of HOT versus control regions from the DNA sequence alone. Here, we show that Drosophila HOT regions function as transcriptional enhancers with diverse activity patterns. While a large number of bound TFs is the defining feature of HOT regions, many TFs seem to be bound neutrally without any apparent contribution to enhancer activity. HOT enhancers are characterized by a distinct and predictive cis-regulatory signature, which includes motifs for Vielfaltig/Zelda (ZLD), a recently reported activator of the early Drosophila genome (Liang et al. 2008;Harrison et al. 2011;Nien et al. 2011), and Trithoraxlike/GAGA (GAGA), a TF known to form homomeric and heteromeric complexes (Bardwell and Treisman 1994) and to be required for the generation and maintenance of nucleosome-free regions (Croston et al. 1991;Nakayama et al. 2007).
Results and Discussion
HOT regions function as early embryonic enhancers with diverse patternsWe tested a representative set of 108 D. melanogaster HOT regions (see the Materials and Methods; Supplemental Tables 1, 2) in transgenic embryos with sitespecifically integrated transcriptional reporters (Fig. 1A). Strikingly, 94% (102) of these HOT regions drove reporter expression in a specific pattern during embryogenesis. In contrast, only 39% of control regions (16 of 41) functioned as enhancers, including 11 of 21 regions chosen to contain TF-binding sites and five of 20 regions chosen to contain no known binding sites (see the Materials and Methods; Supplemental Fig. S1; Supplemental Table 1). This enrichment of HOT over control regions is highest at stages 3-10 (corresponding to 1-5 h after fertilization) (Fig. 1B), at which most of the chromatin immunoprecipitation (ChIP) experiments leading to the definition of the HOT regions had been performed (MacArthur et al. 2009; The modENCODE Consortium 2010).Contrary to expectations that HOT enhancers might constitute a particular class of enhancers (for example, with ubiquitous activity), they display highly diverse spatial activity patterns in all major presumptive tissues of the blastoderm embryo ( Fig. 1C; Sup...