Background: Microbiome research aims to identify microbes that play crucial roles in both health and disease, with the potential of these microbes to serve as biomarkers for preventing, diagnosing, and treating diseases. However, the presence of compositionality, sparsity, and over-dispersion presents formidable challenges for absolute abundance analysis, leading to potentially misleading results when classical data analysis methods are applied.
Results: To address these challenges, we introduced mbDecoda, a model-based approach for debiased analysis of sparse compositions of microbiomes. mbDecoda employs a zero-inflated negative binomial model, linking mean abundance to the variable of interest through a log link function, and it accommodates the adjustment for confounding factors. Through extensive simulation studies and analysis of real-world microbiome datasets, we demonstrated that mbDecoda compared favorably to state-of-the-art methods in terms of effectiveness, robustness, and reproducibility.
Conclusions: mbDecoda is an accurate and reliable method for absolute abundance analysis. It is expected to be a valuable contribution to the statistical toolbox for for analyzing and interpreting microbiome data.