The human genome contains many thousands of long noncoding RNAs (lncRNAs). While several studies have demonstrated compelling biological and disease roles for individual examples, analytical and experimental approaches to investigate these genes have been hampered by the lack of comprehensive lncRNA annotation. Here, we present and analyze the most complete human lncRNA annotation to date, produced by the GENCODE consortium within the framework of the ENCODE project and comprising 9277 manually annotated genes producing 14,880 transcripts. Our analyses indicate that lncRNAs are generated through pathways similar to that of protein-coding genes, with similar histone-modification profiles, splicing signals, and exon/intron lengths. In contrast to protein-coding genes, however, lncRNAs display a striking bias toward two-exon transcripts, they are predominantly localized in the chromatin and nucleus, and a fraction appear to be preferentially processed into small RNAs. They are under stronger selective pressure than neutrally evolving sequences-particularly in their promoter regions, which display levels of selection comparable to protein-coding genes. Importantly, about one-third seem to have arisen within the primate lineage. Comprehensive analysis of their expression in multiple human organs and brain regions shows that lncRNAs are generally lower expressed than protein-coding genes, and display more tissue-specific expression patterns, with a large fraction of tissuespecific lncRNAs expressed in the brain. Expression correlation analysis indicates that lncRNAs show particularly striking positive correlation with the expression of antisense coding genes. This GENCODE annotation represents a valuable resource for future studies of lncRNAs.
SummaryAs the premier model organism in biomedical research, the laboratory mouse shares the majority of protein-coding genes with humans, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications, and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of other sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
While the long non-coding RNAs (ncRNAs) constitute a large portion of the mammalian transcriptome, their biological functions has remained elusive. A few long ncRNAs that have been studied in any detail silence gene expression in processes such as X-inactivation and imprinting. We used a GENCODE annotation of the human genome to characterize over a thousand long ncRNAs that are expressed in multiple cell lines. Unexpectedly, we found an enhancer-like function for a set of these long ncRNAs in human cell lines. Depletion of a number of ncRNAs led to increased expression of their neighboring protein-coding genes, including the master regulator of hematopoiesis, SCL (also called TAL1), Snai1 and Snai2. Using heterologous transcription assays we demonstrated a requirement for the ncRNAs in mediating such enhancement of gene expression. These results reveal an unanticipated role for a class of long ncRNAs in activation of critical regulators of development and differentiation.
Aneuploidy is usually deleterious in multicellular organisms but appears to be tolerated and potentially beneficial in unicellular organisms, including pathogens. Leishmania, a major protozoan parasite, is emerging as a new model for aneuploidy, since in vitro-cultivated strains are highly aneuploid, with interstrain diversity and intrastrain mosaicism. The alternation of two life stages in different environments (extracellular promastigotes and intracellular amastigotes) offers a unique opportunity to study the impact of environment on aneuploidy and gene expression. We sequenced the whole genomes and transcriptomes of Leishmania donovani strains throughout their adaptation to in vivo conditions mimicking natural vertebrate and invertebrate host environments. The nucleotide sequences were almost unchanged within a strain, in contrast to highly variable aneuploidy. Although high in promastigotes in vitro, aneuploidy dropped significantly in hamster amastigotes, in a progressive and strain-specific manner, accompanied by the emergence of new polysomies. After a passage through a sand fly, smaller yet consistent karyotype changes were detected. Changes in chromosome copy numbers were correlated with the corresponding transcript levels, but additional aneuploidy-independent regulation of gene expression was observed. This affected stage-specific gene expression, downregulation of the entire chromosome 31, and upregulation of gene arrays on chromosomes 5 and 8. Aneuploidy changes in Leishmania are probably adaptive and exploited to modulate the dosage and expression of specific genes; they are well tolerated, but additional mechanisms may exist to regulate the transcript levels of other genes located on aneuploid chromosomes. Our model should allow studies of the impact of aneuploidy on molecular adaptations and cellular fitness.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.