The majority of mammalian genomes are devoted to transposable elements (TEs). Whilst TEs are increasingly recognized for their important biological functions, they are a potential danger to genomic stability and are carefully regulated by the epigenetic system. However, the full complexity of this regulatory system is not understood. Here, using mouse embryonic stem cells, we show that TEs are suppressed by heterochromatic marks like H3K9me3, and are also labelled by all major types of chromatin modification in complex patterns, including bivalent activatory and repressive marks. We identified 29 epigenetic modifiers that significantly deregulated at least one type of TE. The loss of Setdb1, Ncor2, Rnf2, Kat5, Prmt5, Uhrf1, and Rrp8 caused widespread changes in TE expression and chromatin accessibility. These effects were context-specific, with different chromatin modifiers regulating the expression and chromatin accessibility of specific subsets of TEs. Our work reveals the complex patterns of epigenetic regulation of TEs.
The current classification of cells in an organism is largely based on their anatomic and developmental origin. Cells types and tissues are traditionally classified into those that arise from the three embryonic germ layers, the ectoderm, mesoderm and endoderm, but this model does not take into account the organization of cell type-specific patterns of gene expression. Here, we present computational models for cell type and tissue specification derived from a collection of 921 RNA-sequencing samples from 272 distinct mouse cell types or tissues. In an unbiased fashion, this analysis accurately predicts the three known germ layers. Unexpectedly, this analysis also suggests that in total there are eight major domains of cell type-specification, corresponding to the neurectoderm, neural crest, surface ectoderm, endoderm, mesoderm, blood mesoderm, germ cells and the embryonic domain. Further, we identify putative genes responsible for specifying the domain and the cell type. This model has implications for understanding trans-lineage differentiation for stem cells, developmental cell biology and regenerative medicine.
Somatic cell reprogramming by exogenous factors requires cooperation with transcriptional co-activators and co-repressors to effectively remodel the epigenetic environment. How this interplay is regulated remains poorly understood. Here, we demonstrate that NCoR/SMRT co-repressors bind to pluripotency loci to create a barrier to reprogramming with the four Yamanaka factors (OCT4, SOX2, KLF4 and c-MYC), and consequently, suppressing NCoR/SMRT significantly enhances reprogramming efficiency and kinetics. The core epigenetic subunit of the NCoR/SMRT complex, histone deacetylase 3 (HDAC3), contributes to the effects of NCoR/SMRT by inducing histone deacetylation at pluripotency loci. Among the Yamanaka factors, recruitment of NCoR/SMRT-HDAC3 to genomic loci is mostly facilitated by c-MYC. Hence, we describe how c-MYC is beneficial for the early phase of reprogramming but deleterious later. Overall, we uncover a role for NCoR/SMRT co-repressors in reprogramming and propose a dual function for c-MYC in this process.
Transposable elements (TEs) occupy nearly 40% of mammalian genomes and, whilst most are fragmentary and no longer capable of transposition, they can nevertheless contribute to cell function. TEs within genes transcribed by RNA polymerase II can be copied as parts of primary transcripts; however, their full contribution to mature transcript sequences remains unresolved. Here, using long and short read (LR and SR) RNA sequencing data, we show that 26% of coding and 65% of noncoding transcripts in human pluripotent stem cells (hPSCs) contain TE-derived sequences. Different TE families are incorporated into RNAs in unique patterns, with consequences to transcript structure and function. The presence of TE sequences within a transcript is correlated with TE-type specific changes in its subcellular distribution, alterations in steady-state levels and half-life, and differential association with RNA Binding Proteins (RBPs). We identify hPSC-specific incorporation of endogenous retroviruses (ERVs) and LINE:L1 into protein-coding mRNAs, which generate TE sequence-derived peptides. Finally, single cell RNA-seq reveals that hPSCs express ERV-containing transcripts, whilst differentiating subpopulations lack ERVs and express SINE and LINE-containing transcripts. Overall, our comprehensive analysis demonstrates that the incorporation of TE sequences into the RNAs of hPSCs is more widespread and has a greater impact than previously appreciated.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.