2011
DOI: 10.1007/s11306-011-0330-3
|View full text |Cite
|
Sign up to set email alerts
|

Double-check: validation of diagnostic statistics for PLS-DA models in metabolomics studies

Abstract: Partial Least Squares-Discriminant Analysis (PLS-DA) is a PLS regression method with a special binary ‘dummy’ y-variable and it is commonly used for classification purposes and biomarker selection in metabolomics studies. Several statistical approaches are currently in use to validate outcomes of PLS-DA analyses e.g. double cross validation procedures or permutation testing. However, there is a great inconsistency in the optimization and the assessment of performance of PLS-DA models due to many different diag… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

11
520
0
4

Year Published

2014
2014
2022
2022

Publication Types

Select...
8
2

Relationship

1
9

Authors

Journals

citations
Cited by 691 publications
(535 citation statements)
references
References 31 publications
11
520
0
4
Order By: Relevance
“…The accuracy for classification was assessed by means of a double cross-validation scheme. 22,23 The original data set was split into a training set (80%) and a test set (20%) randomly before any step of statistical analysis. The number of OPLS components (3-40 components) was chosen on the basis of a five-fold cross-validation performed on the training set only, and the best model was used to predict the samples in the test set.…”
Section: Resultsmentioning
confidence: 99%
“…The accuracy for classification was assessed by means of a double cross-validation scheme. 22,23 The original data set was split into a training set (80%) and a test set (20%) randomly before any step of statistical analysis. The number of OPLS components (3-40 components) was chosen on the basis of a five-fold cross-validation performed on the training set only, and the best model was used to predict the samples in the test set.…”
Section: Resultsmentioning
confidence: 99%
“…However, no clear separation was obtained to differentiate the two groups of the oil palms, therefore, further separation of the groups were performed by supervised analysis of partial least squares-discriminant analysis (PLS-DA) and orthogonal partial least squares-discriminant analysis (OPLS-DA) model. PLS-DA regression, also known as projection on latent structures is a method practically used for classification a set of group samples and selection of biomarker in metabolomics studies (Szymańska et al, 2012). The model is built between dependent variables (Y) and independent variables (X) (i.e.…”
Section: Resultsmentioning
confidence: 99%
“…These groups of results were evaluated statistically and the resulting models showed values in line with quality parameters R 2 and Q 2 (explained variance of approximately 99% and a predicted variance above 50%) 53 . T-test analysis with p-values and data modelling using the PLS (Principal Least Squares) progression were carried out 54,55 . A large number of signals could be studied in the discrimination of classes considering the Variable-Importance-in-Projection-(VIP) which was set at a minimum value of 2.…”
Section: Resultsmentioning
confidence: 99%