Nanopore sequencing and phylodynamic modelling have been used to reconstruct the transmission dynamics of viral epidemics, but their application to bacterial pathogens has remained challenging. Here, we implement Random Forest models for single nucleotide polymorphism (SNP) polishing to estimate divergence and effective reproduction numbers (Re) of two community-associated, methicillin-resistant Staphylococcus aureus (MRSA) outbreaks in remote Far North Queensland and Papua New Guinea (n = 159). Successive bar-coded panels of S. aureus isolates (2 × 12 per MinION) sequenced at low-coverage (> 5x - 10x) provided sufficient data to accurately infer assembly genotypes with high recall when compared with Illumina references. De novo SNP calling with Clair was followed by SNP polishing using intra- and inter-species models trained on Snippy reference calls. Models achieved sufficient resolution on ST93 outbreak sequence types (> 70 - 90% accuracy and precision) for phylodynamic modelling from lineage-wide hybrid alignments and birth-death skyline models in BEAST2. Our method reproduced phylogenetic topology, geographical source of the outbreaks, and indications of sustained transmission (Re > 1). We provide Nextflow pipelines that implement SNP polisher training, evaluation, and outbreak alignments, enabling reconstruction of within-lineage transmission dynamics for infection control of bacterial disease outbreaks using nanopore sequencing.
Community-associated, methicillin-resistant Staphylococcus aureus (MRSA) lineages have emerged in many geographically distinct regions around the world during the past 30 years. Here, we apply consistent phylodynamic methods across multiple community-associated MRSA lineages to describe and contrast their patterns of emergence and dissemination. We generated whole genome sequencing data for the Australian sequence type (ST) 93-MRSA-IV from remote communities in Far North Queensland and Papua New Guinea, and the Bengal Bay ST772-MRSA-V clone from metropolitan communities in Pakistan. Increases in the effective reproduction number (Re) and sustained transmission (Re > 1) coincided with spread of progenitor methicillin-susceptible S. aureus (MSSA) in remote northern Australia, dissemination of the ST93-MRSA-IV geno-type into population centers on the Australian East Coast, and sub-sequent importation into the highlands of Papua New Guinea and Far North Queensland. Analysis of a ST772-MRSA-V cluster in Pakistan suggests that sustained transmission in the community following importation of resistant genotypes may be more common than previously thought. Applying the same phylodynamic methods to existing lineage datasets, we identified common signatures of epidemic growth in the emergence and epidemiological trajectory of community-associated S. aureus lineages from America, Asia, Australasia and Europe. Surges in Re were observed at the divergence of antibiotic resistant strains, coinciding with their establishment in regional population centers. Epidemic growth was also observed amongst drug-resistant MSSA clades in Africa and northern Australia. Our data suggest that the emergence of community-associated MRSA and MSSA lineages in the late 20th century was driven by a combination of antibiotic resistant genotypes and host epidemiology, leading to abrupt changes in lineage-wide transmission dynamics and sustained transmission in regional population centers.
Nanopore sequencing and phylodynamic modelling have been used to reconstruct the transmission dynamics of viral epidemics, but their application to bacterial pathogens has remained challenging. Cost-effective bacterial genome sequencing and variant calling on nanopore platforms would greatly enhance surveillance and outbreak response in communities without access to sequencing infrastructure. Here, we adapt random forest models for single nucleotide polymorphism (SNP) polishing developed by Sanderson and colleagues (2020) to estimate divergence and effective reproduction numbers (Re) of two methicillin-resistant Staphylococcus aureus (MRSA) outbreaks from remote communities in Far North Queensland and Papua New Guinea (n = 159). Successive barcoded panels of S. aureus isolates (2 x 12 per MinION) sequenced at low-coverage (> 5x - 10x) provided sufficient data to accurately infer genotypes with high recall when compared with Illumina references. Random forest models achieved high resolution on ST93 outbreak sequence types (> 90% accuracy and precision) and enabled phylodynamic inference of epidemiological parameters using birth death skyline models. Our method reproduced phylogenetic topology, origin of the outbreaks, and indications of epidemic growth (Re > 1). Nextflow pipelines implement SNP polisher training, evaluation, and outbreak alignments, enabling reconstruction of within-lineage transmission dynamics for infection control of bacterial disease outbreaks on portable nanopore platforms. Our study shows that nanopore technology can be used for bacterial outbreak reconstruction at competitive costs, providing opportunities for infection control in hospitals and communities without access to sequencing infrastructure, such as in remote northern Australia and Papua New Guinea.
Genomic neighbor typing enables heuristic inference of bacterial lineages and phenotypes from nanopore sequencing data. However, small reference databases may not be sufficiently representative of the diversity of lineages and genotypes present in a collection of isolates. In this study, we explore the use of genomic neighbor typing for surveillance of community-associated Staphylococcus aureus outbreaks in Papua New Guinea (PNG) and Far North Queensland, Australia (FNQ). We developed Sketchy, an implementation of genomic neighbor typing that queries exhaustive whole genome reference databases using MinHash. Evaluations were conducted using nanopore read simulations and six species-wide reference sketches (4832 - 47616 genomes), as well as two S. aureus outbreak data sets sequenced at low depth using a sequential multiplex library protocol on the MinION (n = 160, with matching Illumina data). Heuristic inference of lineages and antimicrobial resistance profiles allowed us to conduct multiplex genotyping in situ at the Papua New Guinea Institute of Medical Research in Goroka, on low-throughput Flongle adapters and using multiple successive libraries on the same MinION flow cell (n = 24 - 48). Comparison to phylogenetically informed genomic neighbor typing with RASE on the dominant outbreak sequence type suggests slightly better performance at predicting lineage-scale genotypes using large sketch sizes, but inferior performance in resolving clade-specific genotypes (methicillin resistance). Sketchy can be used for large-scale bacterial outbreak surveillance and in challenging sequencing scenarios, but improvements to clade-specific genotype inference are needed for diagnostic applications. Sketchy is available open-source at: https://github.com/esteinig/sketchy
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.