Background: Chromatin looping is exceedingly important to gene regulation and a host of other nuclear processes. Many recent insights into 3D chromatin structure across species and cell types have contributed to our understanding of the principles governing chromatin looping. However, 3D genome evolution and how it relates to Mendelian selection remain largely unexplored. CTCF, an insulator protein found at most loop anchors, has been described as the "master weaver" of mammalian genomes, and variations in CTCF occupancy are known to influence looping divergence. A large fraction of mammalian CTCF binding sites fall within transposable elements (TEs) but their contributions to looping variation are unknown. Here we investigated the effect of TE-driven CTCF binding site expansions on chromatin looping in human and mouse.Results: TEs have broadly contributed to CTCF binding and loop boundary specification, primarily forming variable loops across species and cell types and contributing nearly 1/3 of species-specific and cell-specific loops.
Conclusions:Our results demonstrate that TE activity is a major source of looping variability across species and cell types. Thus, TE-mediated CTCF expansions explain a large fraction of population-level looping variation and may play a role in adaptive evolution.
Keywords"chromatin looping" mouse human "transposable elements" CTCF cohesin evolution Background Ever since chromosomes were first observed microscopically, it has been speculated that their 3D structure plays a central role in regulating nuclear function (1,2). Early observations revealed that individual chromosomes occupy distinct nuclear territories and, while their arrangement varies between different cell 3 types, this structure is conserved between mother and daughter cells (2). These findings led to the hypothesis that chromosome structure directly influences cellular phenotypes. Since that time, microscopic and molecular studies have dissected chromatin structure into an intricate hierarchy of large-scale territories, compartments, domains, neighborhoods, and loops (3-8), confirming the importance of 3D structure in regulating gene expression, replication, and other nuclear processes. However, the mechanisms by which these structures are created and maintained, how they modulate gene expression, and how they evolve are still poorly understood.A common feature of chromatin loops is the presence of insulator proteins at their boundaries, most notably CTCF (3,4,(9)(10)(11)(12)(13)(14). Although this property has been observed across distantly-related metazoan phyla (10), it is especially important in mammals, where CTCF knockdown leads to widespread loop disruption and gene dysregulation (15). There are several known cases of differential gene expression resulting from differential looping (16-21), Chromatin loops often direct enhancers to target genes and differential enhancerpromoter contacts can lead to differential gene expression (22). However, there is no consensus on the underlying mechanisms of this proces...