2018
DOI: 10.1093/bioinformatics/bty830
|View full text |Cite
|
Sign up to set email alerts
|

MSPminer: abundance-based reconstitution of microbial pan-genomes from shotgun metagenomic data

Abstract: Motivation Analysis toolkits for shotgun metagenomic data achieve strain-level characterization of complex microbial communities by capturing intra-species gene content variation. Yet, these tools are hampered by the extent of reference genomes that are far from covering all microbial variability, as many species are still not sequenced or have only few strains available. Binning co-abundant genes obtained from de novo assembly is a powerful reference-free technique to d… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
87
0
2

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
3
2

Relationship

2
6

Authors

Journals

citations
Cited by 99 publications
(90 citation statements)
references
References 43 publications
1
87
0
2
Order By: Relevance
“…Divisive and threshold-free agglomerative approaches achieve finer taxonomic resolutions than the threshold-based similarity approach. Using WGS in the ecosystems where a bacterial gene catalog is available, such as the human gut or the pig gut (Xiao et al, 2016), the standard approach consists in mapping the reads against the catalog and then clustering the bacterial genes based on their abundance profiles to produce metagenomic species (MGS) (Nielsen et al, 2014) or clusters of coabundant genes to reconstruct microbial pan-genomes (MSP) (Plaza Oñate et al, 2018). We will refer to taxa, noting that the term can designate OTUs, ASVs, oligotypes, MGSs, MSPs and generally any feature found in abundance tables (obtained by counting the number of copies of each feature in each sample).…”
Section: Introductionmentioning
confidence: 99%
“…Divisive and threshold-free agglomerative approaches achieve finer taxonomic resolutions than the threshold-based similarity approach. Using WGS in the ecosystems where a bacterial gene catalog is available, such as the human gut or the pig gut (Xiao et al, 2016), the standard approach consists in mapping the reads against the catalog and then clustering the bacterial genes based on their abundance profiles to produce metagenomic species (MGS) (Nielsen et al, 2014) or clusters of coabundant genes to reconstruct microbial pan-genomes (MSP) (Plaza Oñate et al, 2018). We will refer to taxa, noting that the term can designate OTUs, ASVs, oligotypes, MGSs, MSPs and generally any feature found in abundance tables (obtained by counting the number of copies of each feature in each sample).…”
Section: Introductionmentioning
confidence: 99%
“…Next, whole-metagenomic sequencing was performed on 138 individuals (102 IBS patients and 36 healthy subjects) (Table 1). Metagenomic reads (with an average of 14 million reads per sample) were mapped onto a catalog of Metagenomic Species Pangenomes (MSPs) 27 , yielding a total of 1,661 MSPs. On the basis of per-individual genetic content, 166 of them were further divided into 523 subspecies, corresponding to a mean of 75.3% of the metagenome read mass.…”
Section: Resultsmentioning
confidence: 99%
“…Metagenomics species pangenomes (MSPs) are co-abundant gene groups that can be considered part of complete microbial species pangenomes. MSP gene content was extracted from a previous publication by Plaza-Onate et al 27 . MSP gene content was subdivided into core and accessory genes.…”
Section: Methodsmentioning
confidence: 99%
“…Finally, the Zeller MSP data originates from the same study as the Zeller data (Zeller et al, 2014). It was created from the shotgun data by reconstructing Metagenomics Species Pan-genomes (MSPs) abundance count table, as reported in Plaza Oñate et al (2018). Briefly, reads were quality-filtered and unique reads were mapped against the 9.9 million Integrated Gene Catalog (Li et al, 2014) using BBmap (Bushnell, 2014).…”
Section: Methodsmentioning
confidence: 99%
“…Divisive and threshold-free agglomerative approaches achieve finer taxonomic resolutions than the threshold-based similarity approach. Using WGS in the ecosystems where a bacterial gene catalog is available, such as the human gut (Li et al, 2014) or the pig gut (Xiao et al, 2016), the standard approach consists in mapping the reads against the catalog and then clustering the bacterial genes based on their abundance profiles to produce metagenomic species (MGS) (Nielsen et al, 2014) or clusters of co-abundant genes to reconstruct microbial pan-genomes (MSP) (Plaza Oñate et al, 2018). We will refer to taxa, noting that the term can designate OTUs, ASVs, oligotypes, MGSs, MSPs and generally any feature found in abundance tables.…”
Section: Introductionmentioning
confidence: 99%