Massively-parallel cDNA sequencing has opened the way to deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here, we present the Trinity methodology for de novo full-length transcriptome reconstruction, and evaluate it on samples from fission yeast, mouse, and whitefly – an insect whose genome has not yet been sequenced. Trinity fully reconstructs a large fraction of the transcripts present in the data, also reporting alternative splice isoforms and transcripts from recently duplicated genes. In all cases, Trinity performs better than other available de novo transcriptome assembly programs, and its sensitivity is comparable to methods relying on genome alignments. Our approach provides a unified and general solution for transcriptome reconstruction in any sample, especially in the complete absence of a reference genome.
We describe Hi-C, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. We constructed spatial proximity maps of the human genome with Hi-C at a resolution of 1Mb. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free conformation that enables maximally dense packing while preserving the ability to easily fold and
Hereditary haemochromatosis (HH), which affects some 1 in 400 and has an estimated carrier frequency of 1 in 10 individuals of Northern European descent, results in multi-organ dysfunction caused by increased iron deposition, and is treatable if detected early. Using linkage-disequilibrium and full haplotype analysis, we have identified a 250-kilobase region more than 3 megabases telomeric of the major histocompatibility complex (MHC) that is identical-by-descent in 85% of patient chromosomes. Within this region, we have identified a gene related to the MHC class I family, termed HLA-H, containing two missense alterations. One of these is predicted to inactivate this class of proteins and was found homozygous in 83% of 178 patients. A role of this gene in haemochromatosis is supported by the frequency and nature of the major mutation and prior studies implicating MHC class I-like proteins in iron metabolism.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.