2010
DOI: 10.1016/j.jmva.2010.04.006
|View full text |Cite
|
Sign up to set email alerts
|

Effective PCA for high-dimension, low-sample-size data with singular value decomposition of cross data matrix

Abstract: a b s t r a c tIn this paper, we propose a new methodology to deal with PCA in high-dimension, low-sample-size (HDLSS) data situations. We give an idea of estimating eigenvalues via singular values of a cross data matrix. We provide consistency properties of the eigenvalue estimation as well as its limiting distribution when the dimension d and the sample size n both grow to infinity in such a way that n is much lower than d. We apply the new methodology to estimating PC directions and PC scores in HDLSS data … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
52
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 62 publications
(54 citation statements)
references
References 11 publications
1
52
0
Order By: Relevance
“…In order to study the performance of the two-stage procedure given by (12) and (13), we used computer simulations. Our goal was to construct a 95% givenbandwidth confidence region, R N .…”
Section: Simulationmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to study the performance of the two-stage procedure given by (12) and (13), we used computer simulations. Our goal was to construct a 95% givenbandwidth confidence region, R N .…”
Section: Simulationmentioning
confidence: 99%
“…We divided each type into two groups with respect to the relapse as follows: (a) π 1 : B-cell with the relapse (n 1 = 50 samples); (b) π 2 : B-cell without the relapse (n 2 = 26 samples); (c) π 3 : T-cell with the relapse (n 3 = 15 samples); and (d) π 4 : T-cell without the relapse (n 4 = 9 samples). Table 2 Required sample size and coverage probability given by (12) and (13) …”
Section: Examplementioning
confidence: 99%
“…Actually, one cannot handle the noise space by S D for high-dimension, non-Gaussian data. In order to handle the two-sample problem for high-dimensional non-Gaussian data, we shall consider applying the CDM method given by Yata and Aoshima (2010). …”
Section: Noise Space For High-dimensional Non-gaussian Datamentioning
confidence: 99%
“…The cross-data-matrix (CDM) methodology is high-dimensional Principal Component Analysis developed by Yata and Aoshima (2010). Again, throughout this section, we omit the population index to avoid confusion.…”
Section: Bias-corrected Cdm Estimatormentioning
confidence: 99%
See 1 more Smart Citation