Colibactin is a gut microbiome metabolite of unknown structure that has been implicated in colorectal cancer formation. Several studies now suggest that the tumorgenicity of colibactin derives from interstrand cross-linking of host DNA. Here we use a combination of genetics, isotope labeling, tandem MS, and chemical synthesis to deduce the structure of colibactin. Our structural assignment accounts for all known biosynthetic data and suggests roles 5 for the final unaccounted enzymes in the colibactin gene cluster. DNA cross-link degradation products derived from synthetic and natural colibactin were indistinguishable by tandem MS analysis, thereby confirming the structure prediction. This work reveals the structure of colibactin, which has remained incompletely defined for over a decade.
Main Text: 10Introduction.Colibactin is the product of the clb (also referred to as pks) biosynthetic gene cluster (BGC) that is commonly present within E. coli found in the human colon (1, 2). Intensive interest in colibactin has been fueled by reports that clb + E. coli induce DNA damage in eukaryotic cells in vitro (3) and in vivo (4), promote tumor formation in mouse models of 15 colorectal cancer (CRC) (5-7), and are more prevalent in CRC patients than healthy subjects (5, 8). However, despite intensive efforts, the structure of colibactin has remained unknown for over a decade (9-13).Traditionally, microbial natural products have been produced by large-scale fermentation of bacterial cultures, discovered via activity-guided isolation, and their structures determined by 20 well-established chemical spectroscopic methods. However, because colibactin has been recalcitrant to these classical isolation techniques, our knowledge of its structure and biological activity is derived from diverse interdisciplinary findings. Evaluating the biological effects of 3 clb + bacterial strains has revealed colibactin induces DNA damage in eukaryotic cells.Enzymology, bioinformatic analysis of the clb BGC, stable isotope feeding experiments, characterization of biosynthetic intermediates, and gene deletion and editing studies have given insights into many elements of colibactin's biosynthesis, biological activity, and cellular trafficking. Additionally, by employing chemical synthesis to access shunt metabolites and 5 putative biosynthetic intermediates, a mechanism of action model has been developed that defines the key structural elements of colibactin underling its DNA-damaging properties.Merging of this data forms a picture, albeit incomplete, of colibactin's biosynthesis, structure, and mode of genotoxicity. Colibactin is assembled in a linear prodrug form referred to as precolibactin (see 1, Fig. 1). Key structural elements of precolibactins include a terminal N-10 myristoyl-D-Asn amide (blue in 1) (14-16) and an aminocyclopropane residue (green in 1) (17)(18)(19). The terminal amide is cleaved in the periplasm by the pathway-dedicated serine protease, colibactin peptidase (ClbP) (20,21). The resulting amine 2 undergoes a series of s...