2008
DOI: 10.1007/s00216-007-1790-1
|View full text |Cite
|
Sign up to set email alerts
|

Cross-validation of component models: A critical look at current methods

Abstract: In regression, cross-validation is an effective and popular approach that is used to decide, for example, the number of underlying features, and to estimate the average prediction error. The basic principle of cross-validation is to leave out part of the data, build a model, and then predict the left-out samples. While such an approach can also be envisioned for component models such as principal component analysis (PCA), most current implementations do not comply with the essential requirement that the predic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
250
0
8

Year Published

2010
2010
2017
2017

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 304 publications
(258 citation statements)
references
References 12 publications
0
250
0
8
Order By: Relevance
“…Thus, if the decrease of PRESS when adding the a-th PC is lower than the threshold, the PC is discarded and the model selected contains a − 1 PCs. Also, the curve can be corrected with the degrees of freedom consumed [3].…”
Section: Cross-validation In Pcamentioning
confidence: 99%
See 3 more Smart Citations
“…Thus, if the decrease of PRESS when adding the a-th PC is lower than the threshold, the PC is discarded and the model selected contains a − 1 PCs. Also, the curve can be corrected with the degrees of freedom consumed [3].…”
Section: Cross-validation In Pcamentioning
confidence: 99%
“…The rkf has been criticized because the PCA estimates are computed using the actual values as input [3]. Since there is not independence between actual values and estimates, the modelling error computed in rkf cannot be considered purely prediction error.…”
Section: Cross-validation In Pcamentioning
confidence: 99%
See 2 more Smart Citations
“…For this case, numerous PLS components are usually needed and the x-variables are usually not scaled to equal variance. Typically some kind of cross validation is used to optimize the predictive power of the regression model [17]. Discussion of which validation procedure to use is outside the scope of the present work.…”
Section: Model Interpretation For Non-orthogonal X-variablesmentioning
confidence: 99%