In molecular evolutionary analyses, short DNA sequences are used to infer phylogenetic relationships among species. Here we apply this principle to the study of bacterial biosynthesis, enabling the targeted isolation of previously unidentified natural products directly from complex metagenomes. Our approach uses short natural product sequence tags derived from conserved biosynthetic motifs to profile biosynthetic diversity in the environment and then guide the recovery of gene clusters from metagenomic libraries. The methodology is conceptually simple, requires only a small investment in sequencing, and is not computationally demanding. To demonstrate the power of this approach to natural product discovery we conducted a computational search for epoxyketone proteasome inhibitors within 185 globally distributed soil metagenomes. This led to the identification of 99 unique epoxyketone sequence tags, falling into 6 phylogenetically distinct clades. Complete gene clusters associated with nine unique tags were recovered from four saturating soil metagenomic libraries. Using heterologous expression methodologies, seven potent epoxyketone proteasome inhibitors (clarepoxcins A-E and landepoxcins A and B) were produced from these pathways, including compounds with different warhead structures and a naturally occurring halohydrin prodrug. This study provides a template for the targeted expansion of bacterially derived natural products using the global metagenome.T he advent of cost-effective high-throughput sequencing and an increasingly sophisticated understanding of bacterial secondary metabolite biosynthesis have led to two important revelations with respect to the search for new natural products: first, that the biosynthetic potential of most cultured bacteria, as judged by the number of biosynthetic gene clusters (BGCs) observed in sequenced genomes, is far greater than previously estimated (1, 2); second, that the number of bacterial species in most environments is at least 100× greater than the number of species that is readily cultured (3, 4). These observations suggest that conventional "phenotype-first" natural products isolation approaches have only examined a small fraction of earth's bacterial biosynthetic potential.There are now a number of genomic search engines available that allow researchers to rapidly scan microbial whole genome sequences for BGCs encoding new natural products (5-7). Unfortunately, the large DNA contigs that these search strategies require as input are not readily available from complex metagenomes. In response to the need for a more robust metagenomic search strategy, our group recently developed an informatics platform called eSNaPD (8, 9) (environmental Surveyor of Natural Product Diversity) with the specific aim of facilitating sequence-guided discovery of new bacterial natural products from complex metagenomes (Fig. 1).The eSNaPD software is designed to bioinformatically assess short DNA sequences that have been amplified from environmental metagenomes by degenerate PCR targeting c...