2001
DOI: 10.1002/cem.646
|View full text |Cite
|
Sign up to set email alerts
|

Selecting significant factors by the noise addition method in principal component analysis

Abstract: SUMMARYThe noise addition method (NAM) is presented as a tool for determining the number of significant factors in a data set. The NAM is compared to residual standard deviation (RSD), the factor indicator function (IND), chisquared ( 2 ) and cross-validation (CV) for establishing the number of significant factors in three data sets. The comparison and validation of the NAM are performed through Monte Carlo simulations with noise distributions of varying standard deviation, HPLC/UV-vis chromatographs of a mixt… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
1
0

Year Published

2004
2004
2023
2023

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…This results in “splitting” of an analyte signal across multiple factors and possible contamination of analyte signal with extraneous noise. To put this into context, the problem we address is the classical chemometrics problem of determining the most parsimonious model, i.e., the appropriate number of factors between underfitting and overfitting. ,, It is important to remember, however, that PARAFAC models, unlike PCA-based models, are not nested, e.g., a one- or two-factor model cannot be extracted from a three-factor model. That is to say, a model with N factors can generally be said to have N unique components, none of which are exactly the same as any component in a N − x factor model (for N ≠ 1), where x is any positive integer from 1 to N − 1.…”
mentioning
confidence: 99%
“…This results in “splitting” of an analyte signal across multiple factors and possible contamination of analyte signal with extraneous noise. To put this into context, the problem we address is the classical chemometrics problem of determining the most parsimonious model, i.e., the appropriate number of factors between underfitting and overfitting. ,, It is important to remember, however, that PARAFAC models, unlike PCA-based models, are not nested, e.g., a one- or two-factor model cannot be extracted from a three-factor model. That is to say, a model with N factors can generally be said to have N unique components, none of which are exactly the same as any component in a N − x factor model (for N ≠ 1), where x is any positive integer from 1 to N − 1.…”
mentioning
confidence: 99%