2006
DOI: 10.1186/1471-2164-7-142
|View full text |Cite
|
Sign up to set email alerts
|

Centering, scaling, and transformations: improving the biological information content of metabolomics data

Abstract: Background: Extracting relevant biological information from large data sets is a major challenge in functional genomics research. Different aspects of the data hamper their biological interpretation. For instance, 5000-fold differences in concentration for different metabolites are present in a metabolomics data set, while these differences are not proportional to the biological relevance of these metabolites. However, data analysis methods are not able to make this distinction. Data pretreatment methods can c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

6
1,394
0
14

Year Published

2010
2010
2019
2019

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 1,997 publications
(1,414 citation statements)
references
References 28 publications
6
1,394
0
14
Order By: Relevance
“…Peak areas were mean centered, level scaled to their means ( Van den Berg et al, 2006) and used to build a Partial Least Square Discriminant Analysis Model (PLS-DA), using the Classification Toolbox for MATLAB (Ballabio and Consonni, 2013). PLS-DA models (Positive ions/Negative ions) were built using the following parameters: two components, Bayes assignment, six cross validation (CV) groups in contiguous blocks (i.e., LeaveOne-Out validation as six samples were used).…”
Section: Untargeted Lipid Analysismentioning
confidence: 99%
“…Peak areas were mean centered, level scaled to their means ( Van den Berg et al, 2006) and used to build a Partial Least Square Discriminant Analysis Model (PLS-DA), using the Classification Toolbox for MATLAB (Ballabio and Consonni, 2013). PLS-DA models (Positive ions/Negative ions) were built using the following parameters: two components, Bayes assignment, six cross validation (CV) groups in contiguous blocks (i.e., LeaveOne-Out validation as six samples were used).…”
Section: Untargeted Lipid Analysismentioning
confidence: 99%
“…31 Commonly used methods of normalization in mass spectrometry and their influence on the identification of biomarkers in a non-image setting have been described elsewhere. [32][33][34][35] In the present study, seven methods of normalizing the spectral intensities of each pixel were evaluated. The factor by which the peak intensities of each individual pixel would be divided for each of these normalization methods is shown in Figure 3, where "informative peaks" refers to peaks that remained after variable selection with approaches A and B. Histogram matching is the only method that does not involve the division by a scalar normalization factor, and could therefore not be displayed.…”
Section: Normalizationmentioning
confidence: 99%
“…Centering and scaling: centering and scaling pretreatments were applied to the data set by means of algorithms coded by authors (to more information about scaling preprocessing, lector is remitted to van der Berg et al 12 and literature cited herein).…”
Section: Resultsmentioning
confidence: 99%
“…Several preprocessing methods, including mean centering, autoscaling, range scaling, pareto scaling, vast scaling and level scaling, were applied to the dataset according to literature 12,13 . Before the scaling and centering processes a peak alignment was tested.…”
Section: Multivariate Analysismentioning
confidence: 99%