Duplex stem-loops and four-stranded G-quadruplexes have been implicated in (patho)biological processes. Overlap of stem-loop- and quadruplex-forming sequences could give rise to quadruplex–duplex hybrids (QDH), which combine features of both structural forms and could exhibit unique properties. Here, we present a combined genomic and structural study of stem-loop-containing quadruplex sequences (SLQS) in the human genome. Based on a maximum loop length of 20 nt, our survey identified 80 307 SLQS, embedded within 60 172 unique clusters. Our analysis suggested that these should cover close to half of total SLQS in the entire genome. Among these, 48 508 SLQS were strand-specifically located in genic/promoter regions, with the majority of genes displaying a low number of SLQS. Notably, genes containing abundant SLQS clusters were strongly associated with brain tissues. Enrichment analysis of SLQS-positive genes and mapping of SLQS onto transcriptional/mutagenesis hotspots and cancer-associated genes, provided a statistical framework supporting the biological involvements of SLQS. In
vitro formation of diverse QDH by selective SLQS hits were successfully verified by nuclear magnetic resonance spectroscopy. Folding topologies of two SLQS were elucidated in detail. We also demonstrated that sequence changes at mutation/single-nucleotide polymorphism loci could affect the structural conformations adopted by SLQS. Thus, our predicted SLQS offer novel insights into the potential involvement of QDH in diverse (patho)biological processes and could represent novel regulatory signals.
Naturally occurring hydrazones are
rare despite the ubiquitous
usage of synthetic hydrazones in the preparation of organic compounds
and functional materials. In this study, we discovered a family of
novel microbial metabolites (tasikamides) that share a unique cyclic
pentapeptide scaffold. Surprisingly, tasikamides A–C (1–3) contain a hydrazone group (CNN)
that joins the cyclic peptide scaffold to an alkyl 5-hydroxylanthranilate
(AHA) moiety. We discovered that the biosynthesis of 1–3 requires two discrete gene clusters, with
one encoding a nonribosomal peptide synthetase (NRPS) pathway for
assembling the cyclic peptide scaffold and another encoding the AHA-synthesizing
pathway. The AHA gene cluster encodes three ancillary enzymes that
catalyze the diazotization of AHA to yield an aryl diazonium species
(diazo-AHA). The electrophilic diazo-AHA undergoes nonenzymatic Japp–Klingemann
coupling with a β-keto aldehyde-containing cyclic peptide precursor
to furnish the hydrazone group and yield 1–3. The studies together unraveled a novel mechanism whereby
specialized metabolites are formed by the coupling of two biosynthetic
pathways via an unprecedented in vivo Japp–Klingemann reaction. The findings raise the prospect
of exploiting the arylamine-diazotizing enzymes (AAD) for the in vivo synthesis of aryl compounds and modification of
biological macromolecules.
We report the genome-guided discovery of sungeidines, a class of microbial secondary metabolites with unique structural features. Despite evolutionary relationships with dynemicin-type enediynes, the sungeidines are produced by a biosynthetic gene cluster (BGC) that exhibits distinct differences from known enediyne BGCs. Our studies suggest that the sungeidines are assembled from two octaketide chains that are processed differently than those of the dynemicin-type enediynes. The biosynthesis also involves a unique activating sulfotransferase that promotes a dehydration reaction. The loss of genes, including a putative epoxidase gene, is likely to be the main cause of the divergence of the sungeidine pathway from other canonical enediyne pathways. The findings disclose the surprising evolvability of enediyne pathways and set the stage for characterizing the intriguing enzymatic steps in sungeidine biosynthesis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.