The transmission of information from DNA to RNA is a critical process. We compared RNA sequences from human B cells of 27 individuals to the corresponding DNA sequences from the same individuals and uncovered more than 10,000 exonic sites where the RNA sequences do not match that of the DNA. All 12 possible categories of discordances were observed. These differences were nonrandom as many sites were found in multiple individuals and in different cell types, including primary skin cells and brain tissues. Using mass spectrometry, we detected peptides that are translated from the discordant RNA sequences and thus do not correspond exactly to the DNA sequences. These widespread RNA-DNA differences in the human transcriptome provide a yet unexplored aspect of genome variation.
SUMMARY
Adenosine deaminases acting on RNA (ADARs) convert adenosine to inosine, which is then recognized as guanosine. To study the role of ADAR proteins in RNA editing and gene regulation, we sequenced and compared the DNA and RNA of human B cells. Then, we followed up the findings experimentally with siRNA knockdown and RNA and protein immunoprecipitations. The results uncovered over 60,000 A-to-G editing sites and several thousand genes whose expression levels are influenced by ADARs. Of these ADAR targets, 90% were identified. Our results also reveal that ADAR regulates transcript stability and gene expression through interaction with HuR (ELAVL1). These findings extend the role of ADAR and show that it cooperates with other RNA-processing proteins to regulate the sequence and expression of transcripts in human cells.
Complex regulation of gene expression in mammals has evolved from simpler eukaryotic systems, yet the mechanistic features of this evolution remain elusive. Here, we compared the transcriptional landscapes of the distantly related budding and fission yeast. We adapted the Precision Run-On sequencing (PRO-seq) approach to map the positions of RNA polymerase active sites genome-wide in Schizosaccharomyces pombe and Saccharomyces cerevisiae. Additionally, we mapped preferred sites of transcription initiation in each organism using PRO-cap. Unexpectedly, we identify a pause in early elongation, specific to S. pombe, that requires the conserved elongation factor subunit Spt4 and resembles promoter-proximal pausing in metazoans. PRO-seq profiles in strains lacking Spt4 reveal globally elevated levels of transcribing RNA Polymerase II (Pol II) within genes in both species. Messenger RNA abundance, however, does not reflect the increases in Pol II density, indicating a global reduction in elongation rate. Together, our results provide the first base-pair resolution map of transcription elongation in S. pombe and identify divergent roles for Spt4 in controlling elongation in budding and fission yeast.
R-loops are three-stranded nucleic acid structures found abundantly and yet often viewed as by-products of transcription. Studying cells from patients with a motor neuron disease (amyotrophic lateral sclerosis 4 [ALS4]) caused by a mutation in senataxin, we uncovered how R-loops promote transcription. In ALS4 patients, the senataxin mutation depletes R-loops with a consequent effect on gene expression. With fewer R-loops in ALS4 cells, the expression of BAMBI, a negative regulator of transforming growth factor β (TGF-β), is reduced; that then leads to the activation of the TGF-β pathway. We uncovered that genome-wide R-loops influence promoter methylation of over 1,200 human genes. DNA methyl-transferase 1 favors binding to double-stranded DNA over R-loops. Thus, in forming R-loops, nascent RNA blocks DNA methylation and promotes further transcription. Hence, our results show that nucleic acid structures, in addition to sequences, influence the binding and activity of regulatory proteins.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.