2021
DOI: 10.1101/2021.05.15.444300
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Compositional data analysis of microbiome and any-omics datasets: a revalidation of the additive logratio transformation

Abstract: Background: Microbiome and omics datasets are, by their intrinsic biological nature, of high dimensionality, characterized by counts of large numbers of components (microbial genes, operational taxonomic units, RNA transcripts, etc...). These data are generally regarded as compositional since the total number of counts identified within a sample are irrelevant. The central concept in compositional data analysis is the logratio transformation, the simplest being the additive logratios with respect to a fixed… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2

Relationship

3
2

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 35 publications
0
3
0
Order By: Relevance
“…the alr interpretation reducing it to the numerator part [137] (Figure S2). The abundance of MG ribulose-phosphate 3-epimerase [EC:5.1.3.1] (rpe, KEGG code K01783) involved in pentose phosphate pathway (average relative abundance of 0.03% in our population) was selected as denominator with a high Procrustes correlation equal to 0.9974 and a small log-ratio variance equal to 0.0379 (coefficient of variation =5.08%, five points summary: min=−4.516, 1st quartile=−3.555, median=−3.466, mean =−3.502, 3rd quartile −3.405, max.…”
Section: Metagenomics Datamentioning
confidence: 99%
“…the alr interpretation reducing it to the numerator part [137] (Figure S2). The abundance of MG ribulose-phosphate 3-epimerase [EC:5.1.3.1] (rpe, KEGG code K01783) involved in pentose phosphate pathway (average relative abundance of 0.03% in our population) was selected as denominator with a high Procrustes correlation equal to 0.9974 and a small log-ratio variance equal to 0.0379 (coefficient of variation =5.08%, five points summary: min=−4.516, 1st quartile=−3.555, median=−3.466, mean =−3.502, 3rd quartile −3.405, max.…”
Section: Metagenomics Datamentioning
confidence: 99%
“…However, there are alternative transformations, not using ratios but admitting zero values, which are not strictly coherent, but can be close enough to coherence and isometry to be satisfactory in practice. For example, the chiPower transformation (Greenacre, 2023), using a power transformation of the compositional data combined with a standardization used in the chi-square distance of correspondence analysis, does not need zero replacement. Essentially, dealing with zeros can be problematic, particularly when there is a limit of detection, e.g.…”
Section: Centred Logratio (Clr)mentioning
confidence: 99%
“…In the current application, where no zero values were recorded for the bronzes but have been inserted into the dataset for values assigned as being below the detection limit of the instruments for the element, we make a power transformation of the data, with the chi-square standardization inherent in correspondence analysis, leading to a geometry of the samples which is compared to the target. This transformation has been called the chiPower transformation (Greenacre, 2023). As in Section 5.2.4, the Procrustes correlation is a convenenient measure of proximity between the two geometries.…”
Section: S2 Analysing Compositional Data Without Zero Replacement: Un...mentioning
confidence: 99%