SummaryWe present a molecular dissection of pause site-dependent transcriptional termination for mammalian RNA polymerase II (Pol II)-transcribed genes. We show that nascent transcripts form RNA/DNA hybrid structures (R-loops) behind elongating Pol II and are especially prevalent over G-rich pause sites positioned downstream of gene poly(A) signals. Senataxin, a helicase protein associated with AOA2/ALS4 neurodegenerative disorders, acts to resolve these R-loop structures and by so doing allows access of the 5′–3′ exonuclease Xrn2 at 3′ cleavage poly(A) sites. This affords 3′ transcript degradation and consequent Pol II termination. In effect, R-loops formed over G-rich pause sites, followed by their resolution by senataxin, are key steps in the termination process.
SummaryThe mechanisms contributing to transcription-associated genomic instability are both complex and incompletely understood. Although R-loops are normal transcriptional intermediates, they are also associated with genomic instability. Here, we show that BRCA1 is recruited to R-loops that form normally over a subset of transcription termination regions. There it mediates the recruitment of a specific, physiological binding partner, senataxin (SETX). Disruption of this complex led to R-loop-driven DNA damage at those loci as reflected by adjacent γ-H2AX accumulation and ssDNA breaks within the untranscribed strand of relevant R-loop structures. Genome-wide analysis revealed widespread BRCA1 binding enrichment at R-loop-rich termination regions (TRs) of actively transcribed genes. Strikingly, within some of these genes in BRCA1 null breast tumors, there are specific insertion/deletion mutations located close to R-loop-mediated BRCA1 binding sites within TRs. Thus, BRCA1/SETX complexes support a DNA repair mechanism that addresses R-loop-based DNA damage at transcriptional pause sites.
R loops are three-stranded nucleic acid structures that comprise nascent RNA hybridized with the DNA template, leaving the nontemplate DNA single-stranded. R loops form naturally during transcription even though their persistent formation can be a risky outcome with deleterious effects on genome integrity. On the other hand, over the last few years, an increasingly strong case has been built for R loops as potential regulators of gene expression. Therefore, understanding their function and regulation under these opposite situations is essential to fully characterize the mechanisms that control genome integrity and gene expression. Here we review recent findings about these interesting structures that highlight their opposite roles in cellular fitness.
The formation of R-loops is a natural consequence of the transcription process, caused by invasion of the DNA duplex by nascent transcripts. These structures have been considered rare transcriptional by-products with potential harmful effects on genome integrity, due to the fragility of the displaced DNA coding strand1. However R-loops may also possess beneficial effects as their widespread formation has been detected over CpG island promoters in human genes2,3. Furthermore we have previously shown that R-loops are particularly enriched over G-rich terminator elements. These facilitate RNA polymerase II (Pol II) pausing prior to efficient termination4. Here we reveal an unanticipated link between R-loops and RNA interference (RNAi)-dependent H3K9me2 formation over pause site termination regions of mammalian protein coding genes. We show that R-loops induce antisense transcription over these pause elements which in turn lead to the generation of double-strand RNA (dsRNA) and recruitment of Dicer, Ago1, Ago2, and G9a histone lysine methyltransferase (HKMT). Consequently an H3K9me2 repressive mark is formed and Heterochromatin Protein 1γ (HP1γ) is recruited, that reinforces Pol II pausing prior to efficient transcriptional termination. We predict that R-loops promote a chromatin architecture that defines the termination region for a substantial subset of mammalian genes.
Roles for long noncoding RNAs (lncRNAs) in gene expression are emerging, but regulation of the lncRNA itself is poorly understood. We have identified a homeodomain protein, AtNDX, that regulates COOLAIR, a set of antisense transcripts originating from the 3' end of Arabidopsis FLOWERING LOCUS C (FLC). AtNDX associates with single-stranded DNA rather than doublestranded DNA non-sequence-specifically in vitro, and localizes to a heterochromatic region in the COOLAIR promoter in vivo. Single-stranded DNA was detected in vivo as part of an RNA-DNA hybrid, or R-loop, that covers the COOLAIR promoter. R-loop stabilization mediated by AtNDX inhibits COOLAIR transcription, which in turn modifies FLC expression. Differential stabilization of R-loops could be a general mechanism influencing gene expression in many organisms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.