The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome.
Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become the dominant technique for mapping transcription factor (TF) binding regions genome-wide. We performed an integrative analysis centered around 457 ChIP-seq data sets on 119 human TFs generated by the ENCODE Consortium. We identified highly enriched sequence motifs in most data sets, revealing new motifs and validating known ones. The motif sites (TF binding sites) are highly conserved evolutionarily and show distinct footprints upon DNase I digestion. We frequently detected secondary motifs in addition to the canonical motifs of the TFs, indicating tethered binding and cobinding between multiple TFs. We observed significant position and orientation preferences between many cobinding TFs. Genes specifically expressed in a cell line are often associated with a greater occurrence of nearby TF binding in that cell line. We observed cell-linespecific secondary motifs that mediate the binding of the histone deacetylase HDAC2 and the enhancer-binding protein EP300. TF binding sites are located in GC-rich, nucleosome-depleted, and DNase I sensitive regions, flanked by wellpositioned nucleosomes, and many of these features show cell type specificity. The GC-richness may be beneficial for regulating TF binding because, when unoccupied by a TF, these regions are occupied by nucleosomes in vivo. We present the results of our analysis in a TF-centric web repository Factorbook (http://factorbook.org) and will continually update this repository as more ENCODE data are generated.
SUMMARY Animal germ cells produce PIWI-interacting RNAs (piRNAs), small silencing RNAs that suppress transposons and enable gamete maturation. Mammalian transposon-silencing piRNAs accumulate early in spermatogenesis, whereas pachytene piRNAs are produced later during post-natal spermatogenesis and account for >95% of all piRNAs in the adult mouse testis. Mutants defective for pachytene piRNA pathway proteins fail to produce mature sperm, but neither the piRNA precursor transcripts nor the trigger for pachytene piRNA production is known. Here, we show that the transcription factor A-MYB initiates pachytene piRNA production. A-MYB drives transcription of both pachytene piRNA precursor RNAs and the mRNAs for core piRNA biogenesis factors, including MIWI, the protein through which pachytene piRNAs function. A-MYB regulation of piRNA pathway proteins and piRNA genes creates a coherent feed-forward loop that ensures the robust accumulation of pachytene piRNAs. This regulatory circuit, which can be detected in rooster testes, likely predates the divergence of birds and mammals.
SUMMARY Numerous chromatin regulators are required for embryonic stem (ES) cell self-renewal and pluripotency, but few have been studied in detail. Here, we examine the roles of several chromatin regulators whose loss affects the pluripotent state of ES cells. We find that Mbd3 and Brg1 antagonistically regulate a common set of genes by regulating promoter nucleosome occupancy. Furthermore, both Mbd3 and Brg1 play key roles in the biology of 5-hydroxymethylcytosine (5hmC): Mbd3 co-localizes with Tet1 and 5hmC in vivo, Mbd3 knockdown preferentially affects expression of 5hmC-marked genes, Mbd3 localization is Tet1-dependent, and Mbd3 preferentially binds to 5hmC relative to 5-methylcytosine in vitro. Finally, both Mbd3 and Brg1 are themselves required for normal levels of 5hmC in vivo. Together, our results identify an effector for 5hmC, and reveal that control of gene expression by antagonistic chromatin regulators is a surprisingly common regulatory strategy in ES cells.
Copy number mutations implicate excess production of α-synuclein as a possibly causative factor in Parkinson’s disease (PD). Using an unbiased screen targeting endogenous gene expression, we discovered that the β2-adrenoreceptor (β2AR) is a regulator of the α-synuclein gene (SNCA). β2AR ligands modulate SNCA transcription through histone 3 lysine 27 acetylation of its promoter and enhancers. Over 11 years of follow-up in 4 million Norwegians, the β2AR agonist salbutamol, a brain-penetrant asthma medication, was associated with reduced risk of developing PD (rate ratio, 0.66; 95% confidence interval, 0.58 to 0.76). Conversely, a β2AR antagonist correlated with increased risk. β2AR activation protected model mice and patient-derived cells. Thus, β2AR is linked to transcription of α-synuclein and risk of PD in a ligand-specific fashion and constitutes a potential target for therapies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.