23 24 [All datasets have been deposited to GEO, with accession number GSE94452, and can be 25 accessed here]. 26 3 Abstract: 27 Topologically associating domains (TAD) and insulated neighborhoods (INs) have been 28 proposed to constrain enhancer-promoter communications to enable cell-type specific 29 transcription programs, but recent studies show that disruption of TADs and INs resulted in 30 relatively mild changes in gene expression profiles. To better understand the role of 31 chromatin architecture in dynamic enhancer-promoter contacts and lineage-specific gene 32 expression, we have utilized the auxin-inducible degron system to acutely deplete CTCF, a 33 key factor involved in TADs and IN formation, in mouse embryonic stem cells (mESCs) and 34 examined chromatin architecture and gene regulation during neural differentiation. We find 35 that while CTCF depletion leads to global weakening of TAD boundaries and loss of INs, only 36 a minor fraction of enhancer-promoter contacts are lost, affecting a small subset of genes. 37 The CTCF-dependent enhancer-promoter contacts tend to be long-range, spanning hundreds 38 of kilobases, and are established directly by CTCF binding to promoters. Disruption of CTCF 39 binding at the promoter reduces enhancer-promoter contacts and transcription, while 40 artificial tethering of CTCF to the promoter restores the enhancer-promoter contacts and 41 gene activation. Genome-wide analysis of CTCF binding and gene expression across multiple 42 mouse tissues suggests that CTCF-dependent promoter-enhancer contacts may regulate 43 4 expression of additional mouse genes, particularly those expressed in the brain. Our results 44 uncover both CTCF-dependent and independent enhancer-promoter contacts, and highlight a 45 distinct role for CTCF in promoting enhancer-promoter contacts and gene activation in 46 addition to its insulator function. 47 48 5 Introduction:
49Transcriptional regulation in mammalian cells is orchestrated by cis-regulatory elements that include 50 promoters, enhancers, insulators and other less well characterized sequences 1,2 . Large-scale 51 projects such as ENCODE have annotated millions of candidate cis-regulatory elements in the 52 human genome and genomes of other mammalian species 3-5 . A majority of these candidate 53 regulatory elements are located far from transcription start sites(i.e. promoters), display tissue and 54 cell-type specific chromatin accessibility, and likely act as enhancers to regulate cell-type specific 55 gene expression. Enhancers are frequently found to be positioned close to their target gene 56 promoters in 3D space at the time of gene activation, suggesting a role for the chromatin 57 architecture in gene regulation 6,7 . Indeed, artificially induced spatial proximity between enhancers 58 and promoters has been shown to lead to gene activation 8,9 . Insulators, on the other hand, act to 59 block enhancer-promoter contacts to prevent ectopic gene activation 10-12 . Clearly, in-depth 60 knowledge of the chromatin architecture in each...