2020
DOI: 10.1101/2020.01.22.915579
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

InStrain enables population genomic analysis from metagenomic data and rigorous detection of identical microbial strains

Abstract: Coexisting microbial cells of the same species often exhibit genetic differences that can affect phenotypes ranging from nutrient preference to pathogenicity. Here we present inStrain, a program that utilizes metagenomic paired reads to profile intra-population genetic diversity (microdiversity) across whole genomes and compare populations in a microdiversity-aware manner, dramatically increasing genomic comparison accuracy when benchmarked against existing methods. We use inStrain to profile >1,000 fecal meta… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
30
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 25 publications
(30 citation statements)
references
References 61 publications
0
30
0
Order By: Relevance
“…Recent FMT analyses have combined deep metagenomic sequencing with the computational detection of single nucleotide polymorphic (SNP) variants in common species marker genes to quantify some properties of strain composition. These approaches [30][31][32][33] have found enrichments of donor SNPs in the recipient microbiota post-FMT suggesting transmission and engraftment 31,32 of some proportion of the donor's strains in the recipient but linkage of these donor microbe SNPs to the donor's discrete bacterial strains remains elusive. While informative, these approaches require very deep metagenomic sequencing to track strains present at even shallow relative abundance 34 .…”
Section: Introductionmentioning
confidence: 99%
“…Recent FMT analyses have combined deep metagenomic sequencing with the computational detection of single nucleotide polymorphic (SNP) variants in common species marker genes to quantify some properties of strain composition. These approaches [30][31][32][33] have found enrichments of donor SNPs in the recipient microbiota post-FMT suggesting transmission and engraftment 31,32 of some proportion of the donor's strains in the recipient but linkage of these donor microbe SNPs to the donor's discrete bacterial strains remains elusive. While informative, these approaches require very deep metagenomic sequencing to track strains present at even shallow relative abundance 34 .…”
Section: Introductionmentioning
confidence: 99%
“…MAGs have led to the discovery of novel deepbranching lineages previously eluding cultivation-based approaches, such as the Asgardarchaeota [16] or the bacterial Candidate Phyla Radiation [11,17], thereby substantially expanding the microbial tree of life [18,19]. Moreover, MAGs can be taxonomically resolved to strain level [20,21] which is particularly beneficial in undersampled environments where reference genomic coverage is scarce [22,23].…”
Section: Introductionmentioning
confidence: 99%
“…These species boundaries can be detected in the SNPs of conserved genes (i.e., the average nucleotide identity; ANI) 3–6 and in the large differences in genome overlap (e.g., by pairwise genome alignment) or gene flow discontinuities driven by the strong bias of horizontal gene transfer within a species rather than across species boundaries 3,7–9 . As in microbial pathogenesis 10,11 , the functional impact of the microbiome is dependent on strain-level variation within a species 12–18 , which has driven computational advances to track strains 19–22 , cluster strains 23 , measure strain stability 7,21,24 , and analyze strain variation 25,26 . Strain-focused algorithms for both the commensal microbiome and infectious disease research have also begun to inform genomic boundaries for bacterial strains 7,21,22 .…”
Section: Introductionmentioning
confidence: 99%
“…As in microbial pathogenesis 10,11 , the functional impact of the microbiome is dependent on strain-level variation within a species 12–18 , which has driven computational advances to track strains 19–22 , cluster strains 23 , measure strain stability 7,21,24 , and analyze strain variation 25,26 . Strain-focused algorithms for both the commensal microbiome and infectious disease research have also begun to inform genomic boundaries for bacterial strains 7,21,22 . Despite the importance of strain-variation, we still lack a broad understanding of the general principles of strain population structure, such as the number of strains in each bacterial species, the stability of these strains 27 , the prevalence of each strain within a species in human and non-human reservoirs, and the fitness differences and environmental changes that drive alterations in strain prevalence 27,28 .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation