The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.
SUMMARY The canonical Wnt/β-catenin signaling pathway governs diverse developmental, homeostatic and pathologic processes. Palmitoylated Wnt ligands engage cell surface Frizzled (Fzd) receptors and Lrp5/6 co-receptors enabling β-catenin nuclear translocation and Tcf/Lef-dependent gene transactivation1–3. Mutations in Wnt downstream signaling components have revealed diverse functions presumptively attributed to Wnt ligands themselves, although direct attribution remains elusive, as complicated by redundancy between 19 mammalian Wnts and 10 Fzds1 and Wnt hydrophobicity2,3. For example, individual Wnt ligand mutations have not revealed homeostatic phenotypes in the intestinal epithelium4, an archetypal canonical Wnt pathway-dependent rapidly self-renewing tissue whose regeneration is fueled by proliferative crypt Lgr5+ intestinal stem cells (ISCs)5–9. R-spondin ligands (Rspo1–4) engage distinct Lgr4-6 and Rnf43/Znrf3 receptor classes10–13, markedly potentiate canonical Wnt/β-catenin signaling and induce intestinal organoid growth in vitro and Lgr5+ ISCs in vivo8,14–17. However, the interchangeability, functional cooperation and relative contributions of Wnt versus Rspo ligands to in vivo canonical Wnt signaling and ISC biology remain unknown. Here, we deconstructed functional roles of Wnt versus Rspo ligands in the intestinal crypt stem cell niche. We demonstrate that the default fate of Lgr5+ ISCs is lineage commitment, escape from which requires both Rspo and Wnt ligands. However, gain-of-function studies using Rspo versus a novel non-lipidated Wnt analog reveal qualitatively distinct, non-interchangeable roles for these ligands in ISCs. Wnts are insufficient to induce Lgr5+ ISC self-renewal, but rather confer a basal competency by maintaining Rspo receptor expression that enables Rspo to actively drive and specify the extent of stem cell expansion. This functionally non-equivalent yet cooperative interplay between Wnt and Rspo ligands establishes a molecular precedent for regulation of mammalian stem cells by distinct priming and self-renewal factors, with broad implications for precision control of tissue regeneration.
The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCodeTM WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.
Large-scale population based analyses coupled with advances in technology have demonstrated that the human genome is more diverse than originally thought. To date, this diversity has largely been uncovered using short read whole genome sequencing. However, standard short-read approaches, used primarily due to accuracy, throughput and costs, fail to give a complete picture of a genome. They struggle to identify large, balanced structural events, cannot access repetitive regions of the genome and fail to resolve the human genome into its two haplotypes. Here we describe an approach that retains long range information while harnessing the advantages of short reads. Starting from only~ ng of DNA, we produce barcoded short read libraries. The use of novel informatic approaches allows for the barcoded short reads to be associated with the long molecules of origin producing a novel datatype known as 'Linked-Reads'. This approach allows for simultaneous detection of small and large variants from a single Linked-Read library. We have previously demonstrated the utility of whole genome Linked-Reads (lrWGS) for performing diploid, de novo assembly of individual genomes (Weisenfeld et al. ). In this manuscript, weshow the advantages of Linked-Reads over standard short read approaches for reference based analysis. We demonstrate the ability of Linked-Reads to reconstruct megabase scale haplotypes and to recover parts of the genome that are typically inaccessible to short reads, including phenotypically important genes such as STRC, SMN and SMN . We demonstrate the ability of both lrWGS and Linked-Read Whole Exome Sequencing (lrWES) to identify complex structural variations, including balanced events, single exon deletions, and single exon duplications. The data presented here show that Linked-Reads provide a scalable approach for comprehensive genome analysis that is not possible using short reads alone.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.