2018
DOI: 10.1080/01621459.2018.1442340
|View full text |Cite
|
Sign up to set email alerts
|

Large Covariance Estimation for Compositional Data Via Composition-Adjusted Thresholding

Abstract: High-dimensional compositional data arise naturally in many applications such as metagenomic data analysis. The observed data lie in a high-dimensional simplex, and conventional statistical methods often fail to produce sensible results due to the unit-sum constraint. In this article, we address the problem of covariance estimation for high-dimensional compositional data, and introduce a composition-adjusted thresholding (COAT) method under the assumption that the basis covariance matrix is sparse. Our method … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

6
71
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 49 publications
(78 citation statements)
references
References 30 publications
6
71
1
Order By: Relevance
“…One of the key challenges in working with compositional data is the presence of unit-sum constraint. For correlation estimation, a common approach (see, e.g., Aitchison (1983); Cao et al (2018); Kurtz et al (2015)) is to first apply the centered log-ratio transform (clr) to the compositional vector of each sample x i ∈ S p…”
Section: Extensions To Compositional Datamentioning
confidence: 99%
See 2 more Smart Citations
“…One of the key challenges in working with compositional data is the presence of unit-sum constraint. For correlation estimation, a common approach (see, e.g., Aitchison (1983); Cao et al (2018); Kurtz et al (2015)) is to first apply the centered log-ratio transform (clr) to the compositional vector of each sample x i ∈ S p…”
Section: Extensions To Compositional Datamentioning
confidence: 99%
“…Microbiome community-level analysis tasks, such as quantifying community composition shifts across conditions or associating high-dimensional species compositions and their taxonomic profiles to each other and to environmental or host-associated covariates, require statistical estimation procedures that can handle the restrictive nature of such sparse proportional (or compositional) microbiome datasets (Li, 2015). Important examples include differential abundance techniques (Mandal et al, 2015;McMurdie and Holmes, 2014), proportionality estimation (Quinn et al, 2017), regression models with compositional covariates (Holmes et al, 2012;Lin et al, 2014), composition-adjusted correlation estimation techniques (Cao et al, 2018;Friedman and Alm, 2012), and sparse graphical models for microbial association networks (Kurtz et al, 2015;Tipton et al, 2018).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…A popular linear measure of association is covariance or, when data are standardized, Pearson correlation. In the microbiome context, covariance estimators are available both for compositional data [20,8] and for absolute microbial abundance data [54]. To account for transitive correlations, estimators of the inverse covariance (or precision) matrix are also available [30,16].…”
Section: A Latent Variable Graphical Model For Microbial Associationsmentioning
confidence: 99%
“…We first investigated the ability of the latent graphical model (SE-slr) to identify direct associations that are obfuscated by purely compositional effects. We compared its performance to SE-glasso and the compositionadjusted thresholding (COAT) estimator [8] under two different scenarios: sparse networks generated (i) from synthetic absolute abundance data and (ii) measured quantitative microbiome profiling (QMP) data [51].…”
Section: Disentangling Compositional Effectsmentioning
confidence: 99%