Genomes encode for genes and the regulatory signals that enable those genes to be transcribed, and are continually shaped by evolution. Genomes, including those of human and yeast, encode for numerous regulatory elements and transcripts that have limited evidence of conservation or function. Here, we sought to create a genomic null hypothesis by quantifying the gene regulatory activity of evolutionarily naïve DNA, using RNA-seq of evolutionarily distant DNA expressed in yeast and computational predictions of random DNA activity in human cells and tissues. In yeast, we found that >99% of bases in naïve DNA expressed as part of one or more transcripts. Naïve transcripts are sometimes spliced, and are similar to evolved transcripts in length and expression distribution, indicating that stable expression and/or splicing are insufficient to indicate adaptation. However, naïve transcripts do not achieve the extreme high expression levels as achieved by evolved genes, and frequently overlap with antisense transcription, suggesting that selection has shaped the yeast transcriptome to achieve high expression and coherent gene structures. In humans, we found that, while random DNA is predicted to have minimal activity, dinucleotide content-matched randomized DNA is predicted to have much of the regulatory activity of evolved sequences, including active chromatin marks at between half (DNase I and H3K4me3) and 1/16th (H3K27ac and H3K4me1) the rate of evolved DNA, and the repression-associated H3K27me3 at about twice the rate of evolved DNA. Naïve human DNA is predicted to be more cell type-specific than evolved DNA and is predicted to generate co-occurring chromatin marks, indicating that these are not reliable indicators of selection. However, extreme high activity is rarely achieved by naïve DNA, consistent with these arising via selection. Our results indicate that evolving regulatory activity from naïve DNA is comparatively easy in both yeast and humans, and we expect to see many biochemically active and cell type-specific DNA sequences in the absence of selection. Such naïve biochemically active sequences have the potential to evolve a function or, if sufficiently detrimental, selection may act to repress them.
The GATA3 gene is essential for T cell differentiation and is surrounded by risk variants for immune traits. Interpretation of these variants is challenging because the regulatory landscape of GATA3 is complex with dozens of potential enhancers spread across a large topological associating domain (TAD) and gene expression quantitative trait locus (eQTL) studies provide limited evidence for variant function. Here, we perform a tiling deletion screen in Jurkat T cells to identify 23 candidate regulatory elements. Using small deletions in primary T helper 2 (Th2) cells, we validate the function of five of these elements, two of which contain risk variants for asthma and allergic diseases. We fine-map genome-wide association study (GWAS) signals in a distal regulatory element, 1 Mb downstream, to identify 14 candidate causal variants. Small deletions spanning candidate rs725861 decrease GATA3 expression in Th2 cells suggesting a causal mechanism for this variant in allergic diseases. Our study demonstrates the power of integrating GWAS signals with deletion mapping and identifies critical regulatory sequences for GATA3.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.