2022
DOI: 10.1093/bioinformatics/btac059
|View full text |Cite
|
Sign up to set email alerts
|

metaboprep: an R package for preanalysis data description and processing

Abstract: Motivation Metabolomics is an increasingly common part of health research and there is need for pre-analytical data processing. Researchers typically need to characterise the data and to exclude errors within the context of the intended analysis. While some pre-processing steps are common, there is currently a lack of standardization and reporting transparency for these procedures. Results Here we introduce metaboprep, a stan… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

4
2

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 31 publications
0
8
0
Order By: Relevance
“…We processed the raw (original scale) data received from Metabolon ( N = 760 samples) in preparation for statistical analysis using a prerelease version of metaboprep (14). Data were filtered based on a series of quality metrics.…”
Section: Methodsmentioning
confidence: 99%
“…We processed the raw (original scale) data received from Metabolon ( N = 760 samples) in preparation for statistical analysis using a prerelease version of metaboprep (14). Data were filtered based on a series of quality metrics.…”
Section: Methodsmentioning
confidence: 99%
“…We applied further quality control (QC) steps using the metaboprep R package: samples and features were excluded based on extreme missingness (80%) before missingness was recalculated and exclusions made using a 20% threshold, additional exclusions were based on sample total sum abundance (>5 SD from the mean) and PCA of independent features (>5 SD from the mean). 30 Protein measurements below the limit of detection were retained with their recorded value with the caveat that data below the limit of detection have a higher risk to be in the non-linear phase of the S-curve of the NPX unit which may bias estimates. All protein measures were standardised and normalised prior to analyses using rank-based inverse normal transformation (RNT).…”
Section: Methodsmentioning
confidence: 99%
“…Data quality checks were carried out locally using a pre-release version of the R package metaboprep [ 18 ] with samples and features excluded from subsequent statistical analysis based on a pre-defined set of quality control (QC) metrics. Full details of the procedures are given in ESM Methods and data summaries produced are included within the associated GitHub repository ( https://github.com/lauracorbin/metabolomics_of_direct ).…”
Section: Methodsmentioning
confidence: 99%