Alternative splicing is widely acknowledged to be a crucial regulator of gene expression and is a key contributor to both normal developmental processes and disease states. While cost-effective and accurate for quantification, short-read RNA-seq lacks the ability to resolve full-length transcript isoforms despite increasingly sophisticated computational methods. Long-read sequencing platforms such as Pacific Biosciences (PacBio) and Oxford Nanopore (ONT) bypass the transcript reconstruction challenges of short reads. Here we introduce TALON, the ENCODE4 pipeline for platform-independent analysis of long-read transcriptomes. We apply TALON to the GM12878 cell line and show that while both PacBio and ONT technologies perform well at full-transcript discovery and quantification, each displayed distinct technical artifacts. We further apply TALON to mouse hippocampus and cortex transcriptomes and find that 422 genes found in these regions have more reads associated with novel isoforms than with annotated ones. We demonstrate that TALON is a capable of tracking both known and novel transcript models as well as their expression levels across datasets for both simple studies and in larger projects. These properties will enable TALON users to move beyond the limitations of short-read data to perform isoform discovery and quantification in a uniform manner on existing and future long-read platforms.
The reconstruction of gene regulatory networks underlying cell differentiation from high-throughput gene expression and chromatin data remains a challenge. Here, we derive dynamic gene regulatory networks for human myeloid differentiation using a 5-day time-series of RNA-seq and ATAC-seq data. We profile HL-60 promyelocytes differentiating into macrophage, neutrophil, monocyte, and monocyte-derived macrophages. We find a rapid response in the expression of key transcription factors and lineage markers that only regulate a subset of their targets at a given time, which is followed by chromatin accessibility changes that occur later along with further gene expression changes. We observe differences between promyelocyte-derived and monocyte-derived macrophages at both the transcriptional and chromatin landscape level, despite using the same differentiation stimulus, which suggest that the path taken by cells in the differentiation landscape defines their end cell state. More generally, our approach of combining neighboring time points and replicates to achieve greater sequencing depth can efficiently infer footprint-based regulatory networks from long series data.
Between November 2021 and February 2022, SARS-CoV-2 Delta and Omicron variants co-circulated in the United States, allowing for co-infections and possible recombination events. We sequenced 29,719 positive samples during this period and analyzed the presence and fraction of reads supporting mutations specific to either the Delta or Omicron variant. Our sequencing protocol uses hybridization capture and is thus less subject to artifacts observed in amplicon-based approaches that may lead to spurious signals for recombinants. We identified 20 co-infections, one of which displayed evidence of a low recombinant viral population. We also identified two independent cases of infection by a Delta-Omicron recombinant virus, where 100% of the viral RNA came from one clonal recombinant. In both cases, the 5'-end of the viral genome was from the Delta genome, and the 3'-end from Omicron, though the breakpoints were different. Delta-Omicron recombinant viruses were rare, and there is currently no evidence that the two Delta-Omicron recombinant viruses identified are more transmissible between hosts compared to the circulating Omicron lineages.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.