Transcriptomic analyses have identified tens of thousands of intergenic, intronic, and cis-antisense long noncoding RNAs (lncRNAs) that are expressed from mammalian genomes. Despite progress in functional characterization, little is known about the post-transcriptional regulation of lncRNAs and their half-lives. Although many are easily detectable by a variety of techniques, it has been assumed that lncRNAs are generally unstable, but this has not been examined genome-wide. Utilizing a custom noncoding RNA array, we determined the half-lives of~800 lncRNAs and~12,000 mRNAs in the mouse Neuro-2a cell line. We find only a minority of lncRNAs are unstable. LncRNA half-lives vary over a wide range, comparable to, although on average less than, that of mRNAs, suggestive of complex metabolism and widespread functionality. Combining half-lives with comprehensive lncRNA annotations identified hundreds of unstable (half-life < 2 h) intergenic, cis-antisense, and intronic lncRNAs, as well as lncRNAs showing extreme stability (half-life > 16 h). Analysis of lncRNA features revealed that intergenic and cis-antisense RNAs are more stable than those derived from introns, as are spliced lncRNAs compared to unspliced (single exon) transcripts. Subcellular localization of lncRNAs indicated widespread trafficking to different cellular locations, with nuclear-localized lncRNAs more likely to be unstable. Surprisingly, one of the least stable lncRNAs is the well-characterized paraspeckle RNA Neat1, suggesting Neat1 instability contributes to the dynamic nature of this subnuclear domain. We have created an online interactive resource (http://stability. matticklab.com) that allows easy navigation of lncRNA and mRNA stability profiles and provides a comprehensive annotation of~7200 mouse lncRNAs.
Large numbers of long RNAs with little or no protein-coding potential [long noncoding RNAs (lncRNAs)] are being identified in eukaryotes. In parallel, increasing data describing the expression profiles, molecular features and functions of individual lncRNAs in a variety of systems are accumulating. To enable the systematic compilation and updating of this information, we have developed a database (lncRNAdb) containing a comprehensive list of lncRNAs that have been shown to have, or to be associated with, biological functions in eukaryotes, as well as messenger RNAs that have regulatory roles. Each entry contains referenced information about the RNA, including sequences, structural information, genomic context, expression, subcellular localization, conservation, functional evidence and other relevant information. lncRNAdb can be searched by querying published RNA names and aliases, sequences, species and associated protein-coding genes, as well as terms contained in the annotations, such as the tissues in which the transcripts are expressed and associated diseases. In addition, lncRNAdb is linked to the UCSC Genome Browser for visualization and Noncoding RNA Expression Database (NRED) for expression information from a variety of sources. lncRNAdb provides a platform for the ongoing collation of the literature pertaining to lncRNAs and their association with other genomic elements. lncRNAdb can be accessed at: http://www.lncrnadb.org/.
During the splicing reaction, the 59 intron end is joined to the branchpoint nucleotide, selecting the next exon to incorporate into the mature RNA and forming an intron lariat, which is excised. Despite a critical role in gene splicing, the locations and features of human splicing branchpoints are largely unknown. We use exoribonuclease digestion and targeted RNA-sequencing to enrich for sequences that traverse the lariat junction and, by split and inverted alignment, reveal the branchpoint. We identify 59,359 high-confidence human branchpoints in >10,000 genes, providing a first map of splicing branchpoints in the human genome. Branchpoints are predominantly adenosine, highly conserved, and closely distributed to the 39 splice site. Analysis of human branchpoints reveals numerous novel features, including distinct features of branchpoints for alternatively spliced exons and a family of conserved sequence motifs overlapping branchpoints we term B-boxes, which exhibit maximal nucleotide diversity while maintaining interactions with the keto-rich U2 snRNA. Different B-box motifs exhibit divergent usage in vertebrate lineages and associate with other splicing elements and distinct intron-exon architectures, suggesting integration within a broader regulatory splicing code. Lastly, although branchpoints are refractory to common mutational processes and genetic variation, mutations occurring at branchpoint nucleotides are enriched for disease associations.[Supplemental material is available for this article.]The majority of human genes are spliced, a process whereby introns are removed from the nascent RNA and the remaining exonic sequence joined together into a mature RNA transcript. In addition, alternative splicing generates complex networks of isoforms from human gene loci and plays a major role in shaping the diversity of the transcriptome (Kapranov et al. 2005;Gerstein et al. 2007;Djebali et al. 2012).Splicing occurs in the spliceosome, a large ribonucleoprotein complex that recognizes at least three genetic elements within each intron: the 59 splice site (59SS), the 39 splice site (39SS), and the branchpoint (Will and L€ uhrmann 2011). RNU2-1, the U2 spliceosomal RNA (snRNA) base pairs to the sequence surrounding the unpaired branchpoint nucleotide, which then undergoes transesterification with the 59 end of the intron to form a closed lariat structure. The spliceosome then scans for the downstream 39 splice site, which undergoes a second trans-esterification reaction to join together the two exon ends and excise the intron lariat (Fig.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.