2018
DOI: 10.1101/406264
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Shrinkage improves estimation of microbial associations under different normalization methods

Abstract: Consistent estimation of associations in microbial genomic survey count data is fundamental to microbiome research. Technical limitations, including compositionality, low sample sizes, and technical variability, obstruct standard application of association measures and require data normalization prior to estimating associations. Here, we investigate the interplay between data normalization and microbial association estimation by a comprehensive analysis of statistical consistency. Leveraging the large sample s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 16 publications
(10 citation statements)
references
References 55 publications
0
10
0
Order By: Relevance
“…However, such data require spike-ins and are currently rarely available. Badri et al (2018) have investigated normalization strategies and their effect in correlation analysis but for a single time point, while Metwally et al (2018) proposed three normalization strategies that ignore the compositionality data problem. No method for longitudinal compositional data analysis has been proposed as yet.…”
Section: Discussionmentioning
confidence: 99%
“…However, such data require spike-ins and are currently rarely available. Badri et al (2018) have investigated normalization strategies and their effect in correlation analysis but for a single time point, while Metwally et al (2018) proposed three normalization strategies that ignore the compositionality data problem. No method for longitudinal compositional data analysis has been proposed as yet.…”
Section: Discussionmentioning
confidence: 99%
“…As our experience with anaerobic digestion (see Appendix B) demonstrates, uGLAD can be successfully used as a tool for generating insight into growth dynamics of organisms in a digester and (hopefully) into domain structure in many other applications. Our algorithm works with any input, including: ASVs filtered by frequency, ASVs rolled up to higher taxonomy levels (species, genus, family), ASVs abundance normalized in various ways [2]. We use networkx package to visualize the graphs, presenting positive correlations in green and negative in red, with edge weights corresponding to the strength of the correlation.…”
Section: Discussionmentioning
confidence: 99%
“…For ease of interpretability, we leverage the taxonomic tree information rather than phylogeny in our aggregation framework. To investigate potential human host-microbiome interactions, we re-analyze two human gut datasets, one cohort of HIV patients (Gut (HIV)), available in (Rivera-Pinto et al, 2018), comprising p = 539 OTUs and n = 152 samples, and the other a subset of the American Gut Project data (Gut (AGP)) (McDonald, 2018), provided in (Badri et al, 2020), comprising p = 1387 OTUs present in at least 10% of the n = 6266 samples. To study niche partitioning in terrestrial ecosystems, we use the Central Park soil dataset (Ramirez et al, 2014), as provided by Washburne et al (2017), which consists of p = 3379 OTUs and n = 580 samples with a wide range of soil property measurements.…”
Section: Data Collectionmentioning
confidence: 99%