DNaseI hypersensitive sites (DHSs) are markers of regulatory DNA and have underpinned the discovery of all classes of cis-regulatory elements including enhancers, promoters, insulators, silencers, and locus control regions. Here we present the first extensive map of human DHSs identified through genome-wide profiling in 125 diverse cell and tissue types. We identify ~2.9 million DHSs that encompass virtually all known experimentally-validated cis-regulatory sequences and expose a vast trove of novel elements, most with highly cell-selective regulation. Annotating these elements using ENCODE data reveals novel relationships between chromatin accessibility, transcription, DNA methylation, and regulatory factor occupancy patterns. We connect ~580,000 distal DHSs with their target promoters, revealing systematic pairing of different classes of distal DHSs and specific promoter types. Patterning of chromatin accessibility at many regulatory regions is choreographed with dozens to hundreds of co-activated elements, and the trans-cellular DNaseI sensitivity pattern at a given region can predict cell type-specific functional behaviors. The DHS landscape shows signatures of recent functional evolutionary constraint. However, the DHS compartment in pluripotent and immortalized cells exhibits higher mutation rates than that in highly differentiated cells, exposing an unexpected link between chromatin accessibility, proliferative potential and patterns of human variation.
The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome.
Epigenome editing with the CRISPR/Cas9 platform is a promising technology to modulate gene expression to direct cell phenotype and to dissect the causal epigenetic mechanisms of gene regulation. Fusions of the nuclease-inactive dCas9 to the KRAB repressor (dCas9-KRAB) can silence target gene expression, but the genome-wide specificity and the extent of heterochromatin formation catalyzed by dCas9-KRAB is not known. We targeted dCas9-KRAB to the HS2 enhancer, a distal regulatory element that orchestrates expression of multiple globin genes. Genome-wide analyses demonstrated that localization of dCas9-KRAB to HS2 specifically induced H3K9 tri-methylation (H3K9me3) at the enhancer and reduced the chromatin accessibility of both the enhancer and its promoter targets. Targeted epigenetic modification of HS2 silenced the expression of multiple globin genes, with minimal off-target changes in gene expression. These results demonstrate that repression mediated by dCas9-KRAB is sufficiently specific to disrupt the activity of individual enhancers via local modification of the epigenome.
The human body contains thousands of unique cell types, each with specialized functions. Cell identity is governed in large part by gene transcription programs, which are determined by regulatory elements encoded in DNA. To identify regulatory elements active in seven cell lines representative of diverse human cell types, we used DNase-seq and FAIRE-seq (Formaldehyde Assisted Isolation of Regulatory Elements) to map ''open chromatin.'' Over 870,000 DNaseI or FAIRE sites, which correspond tightly to nucleosome-depleted regions, were identified across the seven cell lines, covering nearly 9% of the genome. The combination of DNaseI and FAIRE is more effective than either assay alone in identifying likely regulatory elements, as judged by coincidence with transcription factor binding locations determined in the same cells. Open chromatin common to all seven cell types tended to be at or near transcription start sites and to be coincident with CTCF binding sites, while open chromatin sites found in only one cell type were typically located away from transcription start sites and contained DNA motifs recognized by regulators of cell-type identity. We show that open chromatin regions bound by CTCF are potent insulators. We identified clusters of open regulatory elements (COREs) that were physically near each other and whose appearance was coordinated among one or more cell types. Gene expression and RNA Pol II binding data support the hypothesis that COREs control gene activity required for the maintenance of cell-type identity. This publicly available atlas of regulatory elements may prove valuable in identifying noncoding DNA sequence variants that are causally linked to human disease.
INTRODUCTIONIdentification of active gene regulatory elements is a key to understanding transcriptional control governing biological processes such as cell-type specificity, differentiation, development, proliferation, and response to the environment. Mapping DNase I hypersensitive (HS) sites has historically been a valuable tool for identifying all different types of regulatory elements, including promoters, enhancers, silencers, insulators, and locus control regions. This method utilizes DNase I to selectively digest nucleosome-depleted DNA (presumably by transcription factors), whereas DNA regions tightly wrapped in nucleosome and higher-order structures are more resistant. The traditional low-throughput method for identifying DNase I HS sites uses Southern blots. Here, we describe the complete and improved protocol for DNase-seq, a high-throughput method that identifies DNase I HS sites across the whole genome by capturing DNase-digested fragments and sequencing them by high-throughput, next-generation sequencing. In a single experiment, DNase-seq can identify most active regulatory regions from potentially any cell type, from any species with a sequenced genome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.