2018
DOI: 10.1002/sim.7896
|View full text |Cite
|
Sign up to set email alerts
|

Analyzing differences between microbiome communities using mixture distributions

Abstract: In this paper, we present a method to assess differences between microbiome communities that effectively models sparse count data and accounts for presence-absence bias frequently encountered when zeros are present. We assume that the observed data for each operational taxonomic unit is Poisson generated with the rate for each sample originating from an underlying rate distribution. We propose to model this distribution using a mixture model, specifying the components based on the posterior rate distribution o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(21 citation statements)
references
References 34 publications
0
21
0
Order By: Relevance
“…The main steps of the model include mixture distribution specification and parameter estimation for modelling observed data, calculation of conditional distributions for each sample, and calculating distances between samples and cluster centres to use in distance-based classification methods. The mixture model and conditional distribution estimation are described in Shestopaloff et al [21]. It is proposed to model the underlying population rate structure of the observed count data using a mixture distribution with Poisson-Gamma components, then conditioning on observed sample counts and resolution to obtain sample-specific distributions.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The main steps of the model include mixture distribution specification and parameter estimation for modelling observed data, calculation of conditional distributions for each sample, and calculating distances between samples and cluster centres to use in distance-based classification methods. The mixture model and conditional distribution estimation are described in Shestopaloff et al [21]. It is proposed to model the underlying population rate structure of the observed count data using a mixture distribution with Poisson-Gamma components, then conditioning on observed sample counts and resolution to obtain sample-specific distributions.…”
Section: Methodsmentioning
confidence: 99%
“…We estimate the joint mixture model using a nonparametric bootstrap algorithm. As stated in Shestopaloff et al [21], we can obtain the weight v(l) of each candidate model, which is the proportion of times each model is selected as optimal relative to the observed data, and calculate the weights for the joint mixture distribution. Let w l be the estimated weights for each candidate model, F l , with zeros assigned to the weights of components not included in a specific model, then the weights of the joint model are…”
Section: Weighted Mixture Distributionmentioning
confidence: 99%
See 1 more Smart Citation
“…To deal with the unique characteristics of microbiome data—sparsity with abundant zeros, we incorporate a mixture model proposed by Shestopaloff [ 40 ] to attain the beta diversity measures for partition. The mixture model focuses on the distribution of a single OTU across a population which can address the problem of sparsity between samples.…”
Section: Materials and Methodsmentioning
confidence: 99%
“…The -C CDF norms can be computed by where is a matrix such that represents the two components in the mixture model. See details of the derivation in [ 40 ].…”
Section: Materials and Methodsmentioning
confidence: 99%