2002
DOI: 10.1002/cem.751
|View full text |Cite
|
Sign up to set email alerts
|

Fast principal component analysis of large data sets based on information extraction

Abstract: Principal component analysis (PCA) and principal component regression (PCR) are routinely used for calibration of measurement devices and for data evaluation. However, their use is hindered in some applications, e.g. hyperspectral imaging, by excessive data sets that imply unacceptable calculation time. This paper discusses a fast PCA achieved by a combination of data compression based on a wavelet transformation and a spectrum selection method prior to the PCA itself. The spectrum selection step can also be a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2003
2003
2007
2007

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 24 publications
(15 citation statements)
references
References 18 publications
0
15
0
Order By: Relevance
“…Usually, calibration spectra x cal 1 ::: x cal K are written in the rows of a calibration matrix X cal . If N > K, however, it is advantageous to write them in columns of X cal , since the computation effort can be decreased considerably [10,11]. Furthermore, considering the principal components (PCs) as columns of an overdetermined equation system is more intuitive, since this is the usual form in multivariate regression analysis.…”
Section: Introductionmentioning
confidence: 99%
“…Usually, calibration spectra x cal 1 ::: x cal K are written in the rows of a calibration matrix X cal . If N > K, however, it is advantageous to write them in columns of X cal , since the computation effort can be decreased considerably [10,11]. Furthermore, considering the principal components (PCs) as columns of an overdetermined equation system is more intuitive, since this is the usual form in multivariate regression analysis.…”
Section: Introductionmentioning
confidence: 99%
“…For the lowest compression, however, acceleration factors are in the range <1-2, that is in two cases the compression-based approach was slightly slower the conventional PARAFAC. This is the first case in several applications of wavelet compression [29][30][31]37,38] where not enough data reduction was achieved to over-compensate computation efforts for the compression.…”
Section: Resultsmentioning
confidence: 98%
“…For example, 1D WTs have been utilized prior to principal component analysis/regression (PCA/PCR) in order to accelerate calculations [29][30][31]. Although (inverse) WTs add computational expense, which is linear with respect to the amount of data [16], smaller data sets allow for decreasing PCA computation times in the second and third order [32].…”
Section: Introductionmentioning
confidence: 99%
“…individually, thus independently optimizing spectral data evaluation. For the first estimate of 2 the proposed selection algorithm is based on considering the sums of squares (SS) [19] of the reduced model (3) and the full model (4). SS is determined from that part of m meas which can be acquired by the PCs:…”
Section: Theorymentioning
confidence: 99%
“…Usually, calibration spectra are written in the rows of a calibration matrix M cal . However, as shown by Vogt and Tacke [3,4], it is advantageous to write calibration spectra in the columns of M cal if N > K, since the computational effort can be considerably decreased. Furthermore, considering PCs as columns of an overdetermined equation system is more intuitive, following the usual form applied in multivariate regression analysis.…”
Section: Introductionmentioning
confidence: 98%