Circular RNAs (circRNAs) have emerged as an important class of functional RNA molecules. Short-read RNA sequencing (RNA-seq) is a widely used strategy to identify circRNAs. However, an inherent limitation of short-read RNA-seq is that it does not experimentally determine the full-length sequences and exact exonic compositions of circRNAs. Here, we report isoCirc, a strategy for sequencing full-length circRNA isoforms, using rolling circle amplification followed by nanopore long-read sequencing. We describe an integrated computational pipeline to reliably characterize full-length circRNA isoforms using isoCirc data. Using isoCirc, we generate a comprehensive catalog of 107,147 full-length circRNA isoforms across 12 human tissues and one human cell line (HEK293), including 40,628 isoforms ≥500 nt in length. We identify widespread alternative splicing events within the internal part of circRNAs, including 720 retained intron events corresponding to a class of exon-intron circRNAs (EIciRNAs). Collectively, isoCirc and the companion dataset provide a useful strategy and resource for studying circRNAs in human transcriptomes.
A major question in human genetics is how sequence variants of broadly expressed genes produce tissue- and cell type-specific molecular phenotypes. Genetic variation of alternative splicing is a prevalent source of transcriptomic and proteomic diversity in human populations. We investigated splicing quantitative trait loci (sQTLs) in 1,209 samples from 13 human brain regions, using RNA sequencing (RNA-seq) and genotype data from the Genotype-Tissue Expression (GTEx) project. Hundreds of sQTLs were identified in each brain region. Some sQTLs were shared across brain regions, whereas others displayed regional specificity. These “regionally ubiquitous” and “regionally specific” sQTLs showed distinct positional distributions of single-nucleotide polymorphisms (SNPs) within and outside essential splice sites, respectively, suggesting their regulation by distinct molecular mechanisms. Integrating the binding motifs and expression patterns of RNA binding proteins with exon splicing profiles, we uncovered likely causal variants underlying brain region-specific sQTLs. Notably, SNP rs17651213 created a putative binding site for the splicing factor RBFOX2 and was associated with increased splicing of MAPT exon 3 in cerebellar tissues, where RBFOX2 was highly expressed. Overall, our study reveals a more comprehensive spectrum and regional variation of sQTLs in human brain and demonstrates that such regional variation can be used to fine map potential causal variants of sQTLs and their associated neurological diseases.
Long-read RNA sequencing (RNA-seq) holds great potential for characterizing transcriptome variation and full-length transcript isoforms, but the relatively high error rate of current long-read sequencing platforms poses a major challenge. We present ESPRESSO, a computational tool for robust discovery and quantification of transcript isoforms from error-prone long reads. ESPRESSO jointly considers alignments of all long reads aligned to a gene and uses error profiles of individual reads to improve the identification of splice junctions and the discovery of their corresponding transcript isoforms. On both a synthetic spike-in RNA sample and human RNA samples, ESPRESSO outperforms multiple contemporary tools in not only transcript isoform discovery but also transcript isoform quantification. In total, we generated and analyzed ~1.1 billion nanopore RNA-seq reads covering 30 human tissue samples and three human cell lines. ESPRESSO and its companion dataset provide a useful resource for studying the RNA repertoire of eukaryotic transcriptomes.
Cancer transcriptomes frequently exhibit RNA dysregulation. As the resulting aberrant transcripts may be translated into cancer-specific proteins, there is growing interest in exploiting RNA dysregulation as a source of tumor antigens (TAs) and thus novel immunotherapy targets. Recent advances in high-throughput technologies and rapid accumulation of multiomic cancer profiling data in public repositories have provided opportunities to systematically characterize RNA dysregulation in cancer and identify antigen targets for immunotherapy. However, given the complexity of cancer transcriptomes and proteomes, important conceptual and technological challenges exist. Here, we highlight the expanding repertoire of TAs arising from RNA dysregulation and introduce multiomic and big data strategies for identifying optimal immunotherapy targets. We discuss extant barriers for translating these targets into effective therapies as well as the implications for future research. RNA Dysregulation as a Source of Cancer Immunotherapy Targets Transcriptomic and proteomic outputs of human cells are controlled by multiple RNA-level regulatory processes. Mechanisms, such as pre-mRNA alternative splicing (AS, see Glossary) and RNA editing, can generate multiple protein isoforms from a single gene, greatly expanding the coding capacity and protein repertoire of human cells. Cancer cells exhibit widespread abnormalities in RNA processing, including driver alterations that functionally contribute to cancer development and progression [1,2]. Through prevalent RNA dysregulation, cancer cells can express a distinct set of transcripts and proteins, some of which may represent therapeutic targets.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.