Unequivocal identification of the full composition of a gene is made difficult by the cryptic nature of regulatory elements. Regulatory elements are notoriously difficult to locate and may reside at considerable distances from the transcription units on which they operate and, moreover, may be incorporated into the structure of neighbouring genes. The importance of regulatory mutations as the basis of human abnormalities remains obscure. Here, we show that the chromosome 7q36 associated preaxial polydactyly, a frequently observed congenital limb malformation, results from point mutations in a Shh regulatory element. Shh, normally expressed in the ZPA posteriorly in the limb bud, is expressed in an additional ectopic site at the anterior margin in mouse models of PPD. Our investigations into the basis of the ectopic Shh expression identified the enhancer element that drives normal Shh expression in the ZPA. The regulator, designated ZRS, lies within intron 5 of the Lmbr1 gene 1 Mb from the target gene Shh. The ZRS drives the early spatio-temporal expression pattern in the limb of tetrapods. Despite the morphological differences between limbs and fins, an equivalent regulatory element is found in fish. The ZRS contains point mutations that segregate with polydactyly in four unrelated families with PPD and in the Hx mouse mutant. Thus point mutations residing in long-range regulatory elements are capable of causing congenital abnormalities, and possess the capacity to modify gene activity such that a novel gamut of abnormalities is detected.
In addition to protein coding sequence, the human genome contains a significant amount of regulatory DNA, the identification of which is proving somewhat recalcitrant to both in silico and functional methods. An approach that has been used with some success is comparative sequence analysis, whereby equivalent genomic regions from different organisms are compared in order to identify both similarities and differences. In general, similarities in sequence between highly divergent organisms imply functional constraint. We have used a whole-genome comparison between humans and the pufferfish, Fugu rubripes, to identify nearly 1,400 highly conserved non-coding sequences. Given the evolutionary divergence between these species, it is likely that these sequences are found in, and furthermore are essential to, all vertebrates. Most, and possibly all, of these sequences are located in and around genes that act as developmental regulators. Some of these sequences are over 90% identical across more than 500 bases, being more highly conserved than coding sequence between these two species. Despite this, we cannot find any similar sequences in invertebrate genomes. In order to begin to functionally test this set of sequences, we have used a rapid in vivo assay system using zebrafish embryos that allows tissue-specific enhancer activity to be identified. Functional data is presented for highly conserved non-coding sequences associated with four unrelated developmental regulators (SOX21, PAX6, HLXB9, and SHH), in order to demonstrate the suitability of this screen to a wide range of genes and expression patterns. Of 25 sequence elements tested around these four genes, 23 show significant enhancer activity in one or more tissues. We have identified a set of non-coding sequences that are highly conserved throughout vertebrates. They are found in clusters across the human genome, principally around genes that are implicated in the regulation of development, including many transcription factors. These highly conserved non-coding sequences are likely to form part of the genomic circuitry that uniquely defines vertebrate development.
SummaryMetazoan development involves the successive activation and silencing of specific gene expression programs and is driven by tissue-specific transcription factors programming the chromatin landscape. To understand how this process executes an entire developmental pathway, we generated global gene expression, chromatin accessibility, histone modification, and transcription factor binding data from purified embryonic stem cell-derived cells representing six sequential stages of hematopoietic specification and differentiation. Our data reveal the nature of regulatory elements driving differential gene expression and inform how transcription factor binding impacts on promoter activity. We present a dynamic core regulatory network model for hematopoietic specification and demonstrate its utility for the design of reprogramming experiments. Functional studies motivated by our genome-wide data uncovered a stage-specific role for TEAD/YAP factors in mammalian hematopoietic specification. Our study presents a powerful resource for studying hematopoiesis and demonstrates how such data advance our understanding of mammalian development.
Aberrant transcriptional programs in combination with abnormal proliferative signaling drive leukemic transformation. These programs operate in normal hematopoiesis where they are involved in hematopoietic stem cell (HSC) proliferation and maintenance. Ets Related Gene (ERG) is a component of normal and leukemic stem cell signatures and high ERG expression is a risk factor for poor prognosis in acute myeloid leukemia (AML). However, mechanisms that underlie ERG expression in AML and how its expression relates to leukemic stemness are unknown. We report that ERG expression in AML is associated with activity of the ERG promoters and +85 stem cell enhancer and a heptad of transcription factors that combinatorially regulate genes in HSCs. Gene expression signatures derived from ERG promoter-stem cell enhancer and heptad activity are associated with clinical outcome when ERG expression alone fails. We also show that the heptad signature is associated with AMLs that lack somatic mutations in NPM1 and confers an adverse prognosis when associated with FLT3 mutations. Taken together, these results suggest that transcriptional regulators cooperate to establish or maintain primitive stem cell-like signatures in leukemic cells and that the underlying pattern of somatic mutations contributes to the development of these signatures and modulate their influence on clinical outcome.
Fish-mammal genomic comparisons have proved powerful in identifying conserved noncoding elements likely to be cis-regulatory in nature, and the majority of those tested in vivo have been shown to act as tissue-specific enhancers associated with genes involved in transcriptional regulation of development. Although most of these elements share little sequence identity to each other, a small number are remarkably similar and appear to be the product of duplication events. Here, we searched for duplicated conserved noncoding elements in the human genome, using comparisons with Fugu to select putative cis-regulatory sequences. We identified 124 families of duplicated elements, each containing between two and five members, that are highly conserved within and between vertebrate genomes. In 74% of cases, we were able to assign a specific set of paralogous genes with annotation relating to transcriptional regulation and/or development to each family, thus removing much of the ambiguity in identifying associated genes. We find that duplicate elements have the potential to up-regulate reporter gene expression in a tissue-specific manner and that expression domains often overlap, but are not necessarily identical, between family members. Over two thirds of the families are conserved in duplicate in fish and appear to predate the large-scale duplication events thought to have occurred at the origin of vertebrates. We propose a model whereby gene duplication and the evolution of cis-regulatory elements can be considered in the context of increased morphological diversity and the emergence of the modern vertebrate body plan.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.