2017
DOI: 10.1002/cem.2887
|View full text |Cite
|
Sign up to set email alerts
|

Weight randomization test for the selection of the number of components in PLS models

Abstract: The selection of the optimal number of components remains a difficult but essential task in partial least squares (PLS). Randomization tests have the advantage of being automatic and they make use of the entire dataset, in contrary with the widely used cross‐validation approaches. Partial least squares modeling may include component(s) with a large amount of irrelevant data variation, and this might affect the model, depending on the assigned y‐loading (which is the regression coefficient in the latent domain)… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
8
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 57 publications
0
8
0
Order By: Relevance
“…PLS was performed in MATLAB using the plsregress function. The number of the model components for each region was chosen using a weight randomization test [ 63 ]. Characteristic features that discriminated between symptomatic and asymptomatic plaques were defined as the lipid ions with a variable importance in the projection (VIP) greater than one [ 64 ] with a loading on the first PLS axis exceeding the 90th percentile of the positive (increased in symptomatic) or negative (increased in asymptomatic) values.…”
Section: Methodsmentioning
confidence: 99%
“…PLS was performed in MATLAB using the plsregress function. The number of the model components for each region was chosen using a weight randomization test [ 63 ]. Characteristic features that discriminated between symptomatic and asymptomatic plaques were defined as the lipid ions with a variable importance in the projection (VIP) greater than one [ 64 ] with a loading on the first PLS axis exceeding the 90th percentile of the positive (increased in symptomatic) or negative (increased in asymptomatic) values.…”
Section: Methodsmentioning
confidence: 99%
“…Researchers have utilised partial least squares discriminant analysis (PLS‐DA) for Raman spectroscopy modelling for many years with great success. [ 16–18 ] In fluorescence spectroscopy, application of PLS‐DA was used to successfully authenticate beef muscles and shows the potential for PLS‐DA to be used in subcutaneous beef fat. Beattie et al [ 19 ] have successfully utilised Raman spectroscopy in combination with PLS‐DA modelling to classify adipose tissue of various species including beef, but no study has utilised these methods to classify the production system of cattle.…”
Section: Introductionmentioning
confidence: 99%
“…For multivariate analysis, the duplicate measurements were used directly, because multivariate methods may distinguish between variance that explains the response and variance that does not explain the response, e.g., instrumental variability. Multivariate analysis included the following: OPLSDA [ 29 , 30 ] in combination with VIP [ 6 ], weight randomization test for partial least squares (WRT-PLS) [ 31 ] in combination with sMC [ 8 ], CARS-PLS-DA[ 11 ] with two latent variables, and sparse PLS-DA with one latent variable [ 10 ]. Double cross-validation was used to validate the results [ 32 ].…”
Section: Methodsmentioning
confidence: 99%