2021
DOI: 10.5705/ss.202019.0052
|View full text |Cite
|
Sign up to set email alerts
|

Asymptotics of eigenstructure of sample correlation matrices for high-dimensional spiked models

Abstract: Sample correlation matrices are widely used, but surprisingly little is known about their asymptotic spectral properties for high-dimensional data beyond the case of "null models", for which the data is assumed to have independent coordinates. In the class of spiked models, we apply random matrix theory to derive asymptotic first-order and distributional results for both the leading eigenvalues and eigenvectors of sample correlation matrices, assuming a high dimensional regime in which the ratio p/n, of number… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 21 publications
(21 citation statements)
references
References 42 publications
0
21
0
Order By: Relevance
“…It is straightforward, so we omit the detail. Given a data set, the spikes of the population correlation R have to be determined; see Fan et al (2020) and Morales-Jimenez et al (2021).…”
Section: One Sample Mean Test For Large P and Small Nmentioning
confidence: 99%
“…It is straightforward, so we omit the detail. Given a data set, the spikes of the population correlation R have to be determined; see Fan et al (2020) and Morales-Jimenez et al (2021).…”
Section: One Sample Mean Test For Large P and Small Nmentioning
confidence: 99%
“…Whether such an approach is statistically beneficial is task-dependent and deserves a more thorough consideration in future work. On the one hand, too many nodes may induce many errors and a systematic distortion in the spectrum of empirical covariance matrices (Donoho et al, 2013, Morales-Jimenez et al, 2021, on the other hand single errors have higher impact in smaller networks, and node estimation constitutes yet another challenging task in the network construction pipeline. A key question in this regard is: How can we minimize the edge-wise estimation variance in the downsized network?…”
Section: Discussionmentioning
confidence: 99%
“…[20]. We however notice that papers addressing the behaviour of the corresponding sample correlation matrices are quite scarce, see [21] when the low rank perturbation affects only the first components of the observations.…”
Section: A Low Vs High-dimensional Regimementioning
confidence: 95%