The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but a similar reference has lacked for epigenomic studies. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection to-date of human epigenomes for primary cells and tissues. Here, we describe the integrative analysis of 111 reference human epigenomes generated as part of the program, profiled for histone modification patterns, DNA accessibility, DNA methylation, and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically-relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation, and human disease.
DNA methylation is a crucial element in the epigenetic regulation of mammalian embryonic development. However, its dynamic patterns have not been analysed at the genome scale in human pre-implantation embryos due to technical difficulties and the scarcity of required materials. Here we systematically profile the methylome of human early embryos from the zygotic stage through to post-implantation by reduced representation bisulphite sequencing and whole-genome bisulphite sequencing. We show that the major wave of genome-wide demethylation is complete at the 2-cell stage, contrary to previous observations in mice. Moreover, the demethylation of the paternal genome is much faster than that of the maternal genome, and by the end of the zygotic stage the genome-wide methylation level in male pronuclei is already lower than that in female pronuclei. The inverse correlation between promoter methylation and gene expression gradually strengthens during early embryonic development, reaching its peak at the post-implantation stage. Furthermore, we show that active genes, with the trimethylation of histone H3 at lysine 4 (H3K4me3) mark at the promoter regions in pluripotent human embryonic stem cells, are essentially devoid of DNA methylation in both mature gametes and throughout pre-implantation development. Finally, we also show that long interspersed nuclear elements or short interspersed nuclear elements that are evolutionarily young are demethylated to a milder extent compared to older elements in the same family and have higher abundance of transcripts, indicating that early embryos tend to retain higher residual methylation at the evolutionarily younger and more active transposable elements. Our work provides insights into the critical features of the methylome of human early embryos, as well as its functional relation to the regulation of gene expression and the repression of transposable elements.
The Collaborative Cross Consortium reports here on the development of a unique genetic resource population. The Collaborative Cross (CC) is a multiparental recombinant inbred panel derived from eight laboratory mouse inbred strains. Breeding of the CC lines was initiated at multiple international sites using mice from The Jackson Laboratory. Currently, this innovative project is breeding independent CC lines at the University of North Carolina (UNC), at Tel Aviv University (TAU), and at Geniad in Western Australia (GND). These institutions aim to make publicly available the completed CC lines and their genotypes and sequence information. We genotyped, and report here, results from 458 extant lines from UNC, TAU, and GND using a custom genotyping array with 7500 SNPs designed to be maximally informative in the CC and used a novel algorithm to infer inherited haplotypes directly from hybridization intensity patterns. We identified lines with breeding errors and cousin lines generated by splitting incipient lines into two or more cousin lines at early generations of inbreeding. We then characterized the genome architecture of 350 genetically independent CC lines. Results showed that founder haplotypes are inherited at the expected frequency, although we also consistently observed highly significant transmission ratio distortion at specific loci across all three populations. On chromosome 2, there is significant overrepresentation of WSB/EiJ alleles, and on chromosome X, there is a large deficit of CC lines with CAST/EiJ alleles. Linkage disequilibrium decays as expected and we saw no evidence of gametic disequilibrium in the CC population as a whole or in random subsets of the population. Gametic equilibrium in the CC population is in marked contrast to the gametic disequilibrium present in a large panel of classical inbred strains. Finally, we discuss access to the CC population and to the associated raw data describing the genetic structure of individual lines. Integration of rich phenotypic and genomic data over time and across a wide variety of fields will be vital to delivering on one of the key attributes of the CC, a common genetic reference platform for identifying causative variants and genetic networks determining traits in mammals.
Oleaginous microalgae are promising feedstock for biofuels, yet the genetic diversity, origin and evolution of oleaginous traits remain largely unknown. Here we present a detailed phylogenomic analysis of five oleaginous Nannochloropsis species (a total of six strains) and one time-series transcriptome dataset for triacylglycerol (TAG) synthesis on one representative strain. Despite small genome sizes, high coding potential and relative paucity of mobile elements, the genomes feature small cores of ca. 2,700 protein-coding genes and a large pan-genome of >38,000 genes. The six genomes share key oleaginous traits, such as the enrichment of selected lipid biosynthesis genes and certain glycoside hydrolase genes that potentially shift carbon flux from chrysolaminaran to TAG synthesis. The eleven type II diacylglycerol acyltransferase genes (DGAT-2) in every strain, each expressed during TAG synthesis, likely originated from three ancient genomes, including the secondary endosymbiosis host and the engulfed green and red algae. Horizontal gene transfers were inferred in most lipid synthesis nodes with expanded gene doses and many glycoside hydrolase genes. Thus multiple genome pooling and horizontal genetic exchange, together with selective inheritance of lipid synthesis genes and species-specific gene loss, have led to the enormous genetic apparatus for oleaginousness and the wide genomic divergence among present-day Nannochloropsis. These findings have important implications in the screening and genetic engineering of microalgae for biofuels.
Human embryonic stem cells (hESCs) are an important source of stem cells in regenerative medicine, and much remains unknown about their molecular characteristics. To develop a detailed genomic profile of ESC lines in two different species, we compared transcriptomes of one murine and two different hESC lines by massively parallel signature sequencing (MPSS). Over 2 million signature tags from each line and their differentiating embryoid bodies were sequenced. Major differences and conserved similarities between species identified by MPSS were validated by reverse transcription polymerase chain reaction (RT-PCR) and microarray. The two hESC lines were similar overall, with differences that are attributable to alleles and propagation. Human-mouse comparisons, however, identified only a small (core) set of conserved genes that included genes known to be important in ESC biology, as well as additional novel genes. Identified were major differences in leukemia inhibitory factor, transforming growth factor-beta, and Wnt and fibroblast growth factor signaling pathways, as well as the expression of genes encoding metabolic, cytoskeletal, and matrix proteins, many of which were verified by RT-PCR or by comparing them with published databases. The study reported here underscores the importance of cross-species comparisons and the versatility and sensitivity of MPSS as a powerful complement to current array technology.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.