2024
DOI: 10.1101/2024.04.10.588779
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Accurate estimation of intraspecific microbial gene content variation in metagenomic data with MIDAS v3 and StrainPGC

Byron J. Smith,
Chunyu Zhao,
Veronika Dubinkina
et al.

Abstract: Metagenomics has greatly expanded our understanding of the gut microbiome by revealing vast diversity within and across human hosts. Even within a single species, different strains can have highly divergent gene content, affecting traits such as antibiotic resistance, metabolism, and virulence. Methods that harness metagenomic data to resolve strain-level differences in functional potential are crucial for understanding the causes and consequences of this intraspecific diversity. The enormous size of pangenome… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(7 citation statements)
references
References 50 publications
0
7
0
Order By: Relevance
“…To perform pangenome profiling, we utilized the Genes module from MIDAS v3 [11], which features careful curation of the pangenome database and comprehensive functional annotation. Specifically, a single Bowtie2 index was built for all 71 species, and QC-ed paired-end reads for each sample were aligned to this index.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…To perform pangenome profiling, we utilized the Genes module from MIDAS v3 [11], which features careful curation of the pangenome database and comprehensive functional annotation. Specifically, a single Bowtie2 index was built for all 71 species, and QC-ed paired-end reads for each sample were aligned to this index.…”
Section: Methodsmentioning
confidence: 99%
“…MicroSLAM implements a generalized linear mixed-effects model that enables two complementary statistical tests of association between a host trait and within-species genetic variation (Fig 1A;Methods). Both tests use the presence/absence of genes from a given species' pangenome across samples, which can be quantified from metagenomic sequencing data using tools such as MIDAS v3, panX, and Roary [11][12][13] . The inputs are a gene presence/absence matrix, any covariates one wishes to include, and the trait data, which can be quantitative or binary (e.g., case/control).…”
Section: Microslam Modeling Approachmentioning
confidence: 99%
See 3 more Smart Citations