In 2001, Celera Genomics and the International Human Genome Sequencing Consortium published their initial drafts of the human genome, which revolutionized the field of genomics. While these drafts and the updates that followed effectively covered the euchromatic fraction of the genome, the heterochromatin and many other complex regions were left unfinished or erroneous. Addressing this remaining 8% of the genome, the Telomere-to-Telomere (T2T) Consortium has finished the first truly complete 3.055 billion base pair (bp) sequence of a human genome, representing the largest improvement to the human reference genome since its initial release. The new T2T-CHM13 reference includes gapless assemblies for all 22 autosomes plus chromosome X, corrects numerous errors, and introduces nearly 200 million bp of novel sequence containing 2,226 paralogous gene copies, 115 of which are predicted to be protein coding. The newly completed regions include all centromeric satellite arrays and the short arms of all five acrocentric chromosomes, unlocking these complex regions of the genome to variational and functional studies for the first time.
Present workflows for producing human genome assemblies from long-read technologies have cost and production time bottlenecks that prohibit efficient scaling to large cohorts. We demonstrate an optimized PromethION nanopore sequencing method for eleven human genomes. The sequencing, performed on one machine in nine days, achieved an average 63x coverage, 42 Kb read N50, 90% median read identity and 6.5x coverage in 100 Kb+ reads using just three flow cells per sample. To assemble these data we introduce new computational tools: Shasta -a de novo long read assembler, and MarginPolish & HELENa suite of nanopore assembly polishing algorithms. On a single commercial compute node Shasta can produce a complete human genome assembly in under six hours, and MarginPolish & HELEN can polish the result in just over a day, achieving 99.9% identity (QV30) for haploid samples from nanopore reads alone. We evaluate assembly performance for diploid, haploid and trio-binned human samples in terms of accuracy, cost, and time and demonstrate improvements relative to current state-of-the-art methods in all areas. We further show that addition of proximity ligation (Hi-C) sequencing yields near chromosome-level scaffolds for all eleven genomes.
31Dynamic changes in chromatin accessibility coincide with important aspects of neuronal differentiation, such as 32 fate specification and arealization and confer cell type-specific associations to neurodevelopmental disorders. 33However, studies of the epigenomic landscape of the developing human brain have yet to be performed at single-34 cell resolution. Here, we profiled chromatin accessibility of >75,000 cells from eight distinct areas of developing 35human forebrain using single cell ATAC-seq (scATACseq). We identified thousands of loci that undergo 36 extensive cell type-specific changes in accessibility during corticogenesis. Chromatin state profiling also reveals 37 novel distinctions between neural progenitor cells from different cortical areas not seen in transcriptomic profiles 38 and suggests a role for retinoic acid signaling in cortical arealization. Comparison of the cell type-specific 39 chromatin landscape of cerebral organoids to primary developing cortex found that organoids establish broad 40 cell type-specific enhancer accessibility patterns similar to the developing cortex, but lack many putative 41 regulatory elements identified in homologous primary cell types. Together, our results reveal the important 42 contribution of chromatin state to the emerging patterns of cell type diversity and cell fate specification and 43 provide a blueprint for evaluating the fidelity and robustness of cerebral organoids as a model for cortical 44 development. 45 46Main text 47The diverse cell types of the human cerebral cortex (Fig. 1a) have been mostly classified based on a handful of 48 morphological, anatomical, and physiological features. Recent innovations in single cell genomics, such as single 49 cell mRNA sequencing (scRNA-seq), have enabled massively parallel profiling of thousands of molecular 50 features in every cell, uncovering the remarkable molecular diversity of cell types previously considered 51 homologous, such as excitatory neurons located in different areas of the cerebral cortex 1-6 . However, the 52 developmental mechanisms underlying the emergence of distinct cellular identities are largely unknown, as most 53 cortical neurons are generated at stages that are inaccessible to experimentation 5 . 54 55Over 60 years ago, Conrad Waddington introduced the concept of an epigenomic landscape to account for the 56 emergence of distinct cell fates 7 . In particular, chromatin state defines the functional architecture of the genome 57
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.