2020
DOI: 10.1002/cem.3226
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of variable selection methods in partial least squares regression

Abstract: Through the remarkable progress in technology, it is getting easier and easier to generate vast amounts of variables from a given sample. The selection of variables is imperative for data reduction and for understanding the modeled relationship. Partial least squares (PLS) regression is among the modeling approaches that address high throughput data. A considerable list of variable selection methods has been introduced in PLS. Most of these methods have been reviewed in a recently conducted study. Motivated by… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
76
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 149 publications
(77 citation statements)
references
References 45 publications
1
76
0
Order By: Relevance
“…Mean-squared error was determined using a 5-fold cross validation, and was used to select the number of vector components retained in each PLSR model. 26 Pearson's linear correlation coefficient was used to compare drift-subtracted CVs to those obtained using the conventional background subtraction approach. All statistical analyses and graphical depiction of data were carried out using GraphPad Prism 6 (GraphPad Software, Inc., La Jolla, CA) or MATLAB R2018a.…”
Section: Data Processing and Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…Mean-squared error was determined using a 5-fold cross validation, and was used to select the number of vector components retained in each PLSR model. 26 Pearson's linear correlation coefficient was used to compare drift-subtracted CVs to those obtained using the conventional background subtraction approach. All statistical analyses and graphical depiction of data were carried out using GraphPad Prism 6 (GraphPad Software, Inc., La Jolla, CA) or MATLAB R2018a.…”
Section: Data Processing and Analysismentioning
confidence: 99%
“…By contrast, PLSR is a supervised dimensionality reduction method that projects both predictor and response variables to a new vector space to determine the PCs that maximize the covariance of projected structures. 26,42 As such, PLSR generally describes training data more efficiently with fewer PCs (than PCR), and output prediction is often more robust. [43][44] The DW-PLSR model is trained using data collected with the sWF and the lWF as the predictor and response, respectively.…”
Section: The Double-waveform Partial-least-squares Regression Modelmentioning
confidence: 99%
“…Numerical methods, such as sRatio and VIP select variables by the amount of a specific value [23]. With sRatio, the ratio of explained variance to residual variance was calculated for each variable.…”
Section: Multivariate Data Analysismentioning
confidence: 99%
“…Numerical methods, such as sRatio and VIP select variables by the amount of a specific value [23]. With sRatio, the ratio of explained variance to residual variance is calculated for each variable.…”
Section: Multivariate Data Analysismentioning
confidence: 99%