G-quadruplexes (G4s) are nucleic acid secondary structures that form within guanine-rich DNA or RNA sequences. G4 formation can affect chromatin architecture and gene regulation and has been associated with genomic instability, genetic diseases and cancer progression. Here we present a high-resolution sequencing-based method to detect G4s in the human genome. We identified 716,310 distinct G4 structures, 451,646 of which were not predicted by computational methods. These included previously uncharacterized noncanonical long loop and bulged structures. We observed a high G4 density in functional regions, such as 5' untranslated regions and splicing sites, as well as in genes previously not predicted to contain these structures (such as BRCA2). G4 formation was significantly associated with oncogenes, tumor suppressors and somatic copy number alterations related to cancer development. The G4s identified in this study may therefore represent promising targets for cancer intervention.
G-quadruplex (G4) structural motifs have been linked to transcription, replication and genome instability and are implicated in cancer and other diseases. However, it is crucial to demonstrate the bona fide formation of G4 structures within an endogenous chromatin context. Herein we address this through the development of G4 ChIP-seq, an antibody-based G4 chromatin immunoprecipitation and high-throughput sequencing approach. We find ∼10,000 G4 structures in human chromatin, predominantly in regulatory, nucleosome-depleted regions. G4 structures are enriched in the promoters and 5' UTRs of highly transcribed genes, particularly in genes related to cancer and in somatic copy number amplifications, such as MYC. Strikingly, de novo and enhanced G4 formation are associated with increased transcriptional activity, as shown by HDAC inhibitor-induced chromatin relaxation and observed in immortalized as compared to normal cellular states. Our findings show that regulatory, nucleosome-depleted chromatin and elevated transcription shape the endogenous human G4 DNA landscape.
Single-stranded guanine-rich DNA sequences can fold into four-stranded DNA structures called G-quadruplexes (G4s) that arise from the self-stacking of two or more guanine quartets. There has been considerable recent progress in the detection and mapping of G4 structures in the human genome and in biologically relevant contexts. These advancements, many of which align with predictions made previously in computational studies, provide important new insights into the functions of G4 structures in, for example, the regulation of transcription and genome stability, and uncover their potential relevance for cancer therapy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.