Anthrax, caused by the bacterium Bacillus anthracis, is a disease of historical and current importance that is found throughout the world. The basis of its historical transmission is anecdotal and its true global population structure has remained largely cryptic. Seven diverse B. anthracis strains were whole-genome sequenced to identify rare single nucleotide polymorphisms (SNPs), followed by phylogenetic reconstruction of these characters onto an evolutionary model. This analysis identified SNPs that define the major clonal lineages within the species. These SNPs, in concert with 15 variable number tandem repeat (VNTR) markers, were used to subtype a collection of 1,033 B. anthracis isolates from 42 countries to create an extensive genotype data set. These analyses subdivided the isolates into three previously recognized major lineages (A, B, and C), with further subdivision into 12 clonal sub-lineages or sub-groups and, finally, 221 unique MLVA15 genotypes. This rare genomic variation was used to document the evolutionary progression of B. anthracis and to establish global patterns of diversity. Isolates in the A lineage are widely dispersed globally, whereas the B and C lineages occur on more restricted spatial scales. Molecular clock models based upon genome-wide synonymous substitutions indicate there was a massive radiation of the A lineage that occurred in the mid-Holocene (3,064–6,127 ybp). On more recent temporal scales, the global population structure of B. anthracis reflects colonial-era importation of specific genotypes from the Old World into the New World, as well as the repeated industrial importation of diverse genotypes into developed countries via spore-contaminated animal products. These findings indicate humans have played an important role in the evolution of anthrax by increasing the proliferation and dispersal of this now global disease. Finally, the value of global genotypic analysis for investigating bioterrorist-mediated outbreaks of anthrax is demonstrated.
Phylogenetic reconstruction using molecular data is often subject to homoplasy, leading to inaccurate conclusions about phylogenetic relationships among operational taxonomic units. Compared with other molecular markers, single-nucleotide polymorphisms (SNPs) exhibit extremely low mutation rates, making them rare in recently emerged pathogens, but they are less prone to homoplasy and thus extremely valuable for phylogenetic analyses. Despite their phylogenetic potential, ascertainment bias occurs when SNP characters are discovered through biased taxonomic sampling; by using whole-genome comparisons of five diverse strains of Bacillus anthracis to facilitate SNP discovery, we show that only polymorphisms lying along the evolutionary pathway between reference strains will be observed. We illustrate this in theoretical and simulated data sets in which complex phylogenetic topologies are reduced to linear evolutionary models. Using a set of 990 SNP markers, we also show how divergent branches in our topologies collapse to single points but provide accurate information on internodal distances and points of origin for ancestral clades. These data allowed us to determine the ancestral root of B. anthracis, showing that it lies closer to a newly described ''C'' branch than to either of two previously described ''A'' or ''B'' branches. In addition, subclade rooting of the C branch revealed unequal evolutionary rates that seem to be correlated with ecological parameters and strain attributes. Our use of nonhomoplastic whole-genome SNP characters allows branch points and clade membership to be estimated with great precision, providing greater insight into epidemiological, ecological, and forensic questions. T he discovery method used to find nonhomoplastic molecular characters can have a great impact on subsequent phylogenetic analyses (1). ''Discovery bias'' will most likely be encountered when polymorphic characters are rare and extraordinary means (i.e., whole-genome sequencing) are necessary to find them. In such cases, only a small subset of the target organisms can be exhaustively examined for genetic differences. When these polymorphisms are assayed across a more extensive and diverse panel of strains, the phylogenetic topology will be impacted greatly by the original choices of strains used for character discovery. Consider a six-taxon dendrogram (Fig. 1) in which nonhomoplastic polymorphic characters are discovered by exhaustive comparisons of two taxa. Mutational events that create character differences on the evolutionary path connecting the two taxa will be discovered, but characters on tangential (i.e., secondary) branches will not. Subsequent character-state determination in all taxa will accurately place them onto a linear phylogeny but will underrepresent the taxonomic complexity by eliminating secondary branching. If the initial character discovery is between two closely related taxa, large diverse clades will be reduced to a single point on a short linear dendrogram (Fig. 1B). This theoretical model assu...
Before the anthrax letter attacks of 2001, the developing field of microbial forensics relied on microbial genotyping schemes based on a small portion of a genome sequence. Amerithrax, the investigation into the anthrax letter attacks, applied high-resolution whole-genome sequencing and comparative genomics to identify key genetic features of the letters' Bacillus anthracis Ames strain. During systematic microbiological analysis of the spore material from the letters, we identified a number of morphological variants based on phenotypic characteristics and the ability to sporulate. The genomes of these morphological variants were sequenced and compared with that of the B. anthracis Ames ancestor, the progenitor of all B. anthracis Ames strains. Through comparative genomics, we identified four distinct loci with verifiable genetic mutations. Three of the four mutations could be directly linked to sporulation pathways in B. anthracis and more specifically to the regulation of the phosphorylation state of Spo0F, a key regulatory protein in the initiation of the sporulation cascade, thus linking phenotype to genotype. None of these variant genotypes were identified in single-colony environmental B. anthracis Ames isolates associated with the investigation. These genotypes were identified only in B. anthracis morphotypes isolated from the letters, indicating that the variants were not prevalent in the environment, not even the environments associated with the investigation. This study demonstrates the forensic value of systematic microbiological analysis combined with whole-genome sequencing and comparative genomics.
Highly precise diagnostics and forensic assays can be developed through a combination of evolutionary analysis and the exhaustive examination of genomic sequences. In Bacillus anthracis, whole-genome sequencing efforts revealed ca. 3,500 single-nucleotide polymorphisms (SNPs) among eight different strains and evolutionary analysis provides the identification of canonical SNPs. We have previously shown that SNPs are highly evolutionarily stable, and the clonal nature of B. anthracis makes them ideal signatures for subtyping this pathogen. Here we identified SNPs that define the lineage of B. anthracis that contains the Ames strain, the strain used in the 2001 bioterrorist attacks in the United States. Sequencing and real-time PCR were used to validate these SNPs across B. anthracis strains, including (i) 88 globally and genetically diverse isolates; (ii) isolates that were shown to be genetic relatives of the Ames strain by multiple-locus variable number tandem repeat analysis (MLVA); and (iii) several different lab stocks of the Ames strain, including a clinical isolate from the 2001 letter attack. Six SNPs were found to be highly specific for the Ames strain; four on the chromosome, one on the pX01 plasmid, and one on the pX02 plasmid. All six SNPs differentiated the B. anthracis Ames strain from the 88 unique B. anthracis strains, while five of the six separated Ames from its close genetic relatives. The use of these SNPs coupled with real-time PCR allows specific and sensitive (<100 fg of template DNA) identification of the Ames strain. This evolutionary and genomics-based approach provides an effective means for the discovery of strain-specific SNPs in B. anthracis.The 2001 anthrax letter attack illustrated the "real-world" efficacy of Bacillus anthracis as a bioterror agent. Forensic and epidemiological analysis of clinical samples and weaponized spores from the letter attack included the identification of the B. anthracis strain as Ames (3,8,17). This was initially accomplished using multiple-locus variable number tandem repeat analysis (MLVA) (3,6,8), which was one of the few technical approaches possible for this highly monomorphic pathogen (5). Identifying the strain and establishing its identity in attack locations were important in linking the dispersed anthrax cases and suggesting a possible source for the weaponized material. During the anthrax attack crisis, diagnostic speed, specificity, and sensitivity were often limiting factors, necessitating extraordinary efforts by public health officials and forensic labs (3). Further advancement of molecular diagnostics allows for more-efficient responses in disease outbreaks, whether natural or bioterrorist mediated.Single-nucleotide polymorphisms (SNPs) are increasingly recognized as important markers for detecting and subtyping bacterial pathogens, including B. anthracis (1,2,4,8,16,18,19). Recent comparative full-genome sequencing allowed the discovery of about 3,500 SNPs among eight strains of B. anthracis (15, 17; J. Ravel, unpublished data) and repre...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.