2021
DOI: 10.1186/s13059-021-02306-1
|View full text |Cite
|
Sign up to set email alerts
|

Microbiome meta-analysis and cross-disease comparison enabled by the SIAMCAT machine learning toolbox

Abstract: The human microbiome is increasingly mined for diagnostic and therapeutic biomarkers using machine learning (ML). However, metagenomics-specific software is scarce, and overoptimistic evaluation and limited cross-study generalization are prevailing issues. To address these, we developed SIAMCAT, a versatile R toolbox for ML-based comparative metagenomics. We demonstrate its capabilities in a meta-analysis of fecal metagenomic studies (10,803 samples). When naively transferred across studies, ML models lost acc… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

5
160
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 168 publications
(165 citation statements)
references
References 109 publications
5
160
0
Order By: Relevance
“…Differential abundance between time points and statistical analysis of the changes over time was obtained with siamcat [ 36 ]. Time points analysed were at consecutive weeks from the start ( t 0): t 0, t 2, t 4, t 6, t 8 and t 10.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Differential abundance between time points and statistical analysis of the changes over time was obtained with siamcat [ 36 ]. Time points analysed were at consecutive weeks from the start ( t 0): t 0, t 2, t 4, t 6, t 8 and t 10.…”
Section: Resultsmentioning
confidence: 99%
“…siamcat [ 36 ] was employed to determine differentially abundant GTDB clustered MAGs between groups. The data were normalized by proportions prior to analysis with siamcat .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Differential abundance analysis was performed by Wilcoxon rank sum test and Analysis of Composition of Microbiomes ( ANCOM ) [32] . Random forest machine learning models were performed using the randomForest package ( SIAMCAT package [33] ) in which five-fold cross-validation was performed with 100 iterations. Microbiome regression analyses were performed using the MaAsLin2 package.…”
Section: Methodsmentioning
confidence: 99%
“…To generate the machine learning models, taxonomic features were used in the form of ASV tables collapsed to specieslevel. For the classification, we used the workflow provided in "Siamcat, " which provides a machine learning toolbox for metagenome analysis through state-of-the-art machine learning methods (Wirbel et al, 2019(Wirbel et al, , 2021. The data were separately processed for fungi and bacteria and sample-wise relative abundance for the microbiome quantitative profiles was used as input data to maintain the uniformity.…”
Section: Machine Learning Analysismentioning
confidence: 99%