2020
DOI: 10.1186/s13059-020-02104-1
|View full text |Cite
|
Sign up to set email alerts
|

Assessment of statistical methods from single cell, bulk RNA-seq, and metagenomics applied to microbiome data

Abstract: Background: The correct identification of differentially abundant microbial taxa between experimental conditions is a methodological and computational challenge. Recent work has produced methods to deal with the high sparsity and compositionality characteristic of microbiome data, but independent benchmarks comparing these to alternatives developed for RNA-seq data analysis are lacking. Results: We compare methods developed for single-cell and bulk RNA-seq, and specifically for microbiome data, in terms of sui… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
80
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 91 publications
(84 citation statements)
references
References 58 publications
3
80
1
Order By: Relevance
“…In particular, count-based models, due to their strong parametric assumptions on the distributions or parametric specifications of the mean-variance dependency, tend to have inflated FDR when the assumptions are violated. In sharp contrast to previous claims, however, compositionality-corrected methods such as ANCOM 14,22 as well as specialized normalization and transformation methods such as CLR 46 did not improve performance over non-compositional approaches 8,47 , consistent with recent findings that compositional methods may not always outperform non-compositional methods 32 . Importantly, these conclusions hold regardless of the nature of the modeling paradigm (i.e.…”
Section: Discussionsupporting
confidence: 74%
See 2 more Smart Citations
“…In particular, count-based models, due to their strong parametric assumptions on the distributions or parametric specifications of the mean-variance dependency, tend to have inflated FDR when the assumptions are violated. In sharp contrast to previous claims, however, compositionality-corrected methods such as ANCOM 14,22 as well as specialized normalization and transformation methods such as CLR 46 did not improve performance over non-compositional approaches 8,47 , consistent with recent findings that compositional methods may not always outperform non-compositional methods 32 . Importantly, these conclusions hold regardless of the nature of the modeling paradigm (i.e.…”
Section: Discussionsupporting
confidence: 74%
“…Notably, previous benchmarking in this area has only focused on differential abundance testing without the simultaneous consideration of multiple covariates and repeated measures 7-9 . Additionally, with the exception of Hawinkel et al 7 , these benchmarking efforts lacked important considerations to the extent that they either (i) did not consider FDR as a metric of evaluation 9,31,32 or (ii) misreported false positive rate as FDR 8 ( Methods ). While a majority of these studies made a final recommendation based on the traditional AUC metric or a combination of sensitivity and specificity, we argue that without considering the FDR-controlling behavior of a method, the AUC values alone are too optimistic to draw any meaningful conclusions about its practical utility.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…This required careful modeling of the sequencing counts which often do not follow normal distributions. Here we chose negative binomial models as they model the high prevalence of zero read counts and have been shown to represent amplicon sequencing data well ( 42 ). Consequently, associations between the microbiome and clinical variables were identified by a robust normalization and testing strategy based on DESeq2 (see Materials and Methods).…”
Section: Resultsmentioning
confidence: 99%
“…Using "reference frames", while still a new and novel approach to evaluating microbial associations, represents an advancement in techniques that shows promise in handling compositional data and may improve the conclusions we draw. While the "reference frame" approach does not designate p-value associations for differential abundance, it shows "greater ability to identify correct taxa" [38,39]. Therefore, it was interesting to apply Songbird to a low-biomass dataset such as the skin microbiota as the test case in the original article had explored a similar low-biomass environment [23].…”
Section: Reference Framesmentioning
confidence: 99%