2023
DOI: 10.1073/pnas.2311420120
|View full text |Cite
|
Sign up to set email alerts
|

Phantom oscillations in principal component analysis

Maxwell Shinn

Abstract: Principal component analysis (PCA) is a dimensionality reduction method that is known for being simple and easy to interpret. Principal components are often interpreted as low-dimensional patterns in high-dimensional space. However, this simple interpretation fails for timeseries, spatial maps, and other continuous data. In these cases, nonoscillatory data may have oscillatory principal components. Here, we show that two common properties of data cause oscillatory principal components: smoothness and shifts in… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
14
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(16 citation statements)
references
References 53 publications
2
14
0
Order By: Relevance
“…Because the SVD decomposed the gene expression into linear combinations of orthogonal basis vectors, we were able to assess the contribution of individual genes, DEGs, or entire Hallmark gene sets to CML initiation and growth. Although the SVD can produce artifacts when applied to time-series data, we observed no oscillations or artifacts, and moreover, we observed a strong correlation between state-space trajectories and BCR::ABL expression as an immunophenotypic marker of disease, supporting our interpretation of the principal component as a transcriptome state-space 31 . We observed that the Es ( c 1 ) state was largely characterized by anti-CML DEGs that resisted CML development whereas the Ts ( c 3 ) and Ls ( c 5 ) disease states were characterized by pro-CML DEGs associated with the movement of state-transition toward late disease state.…”
Section: Discussionsupporting
confidence: 80%
See 1 more Smart Citation
“…Because the SVD decomposed the gene expression into linear combinations of orthogonal basis vectors, we were able to assess the contribution of individual genes, DEGs, or entire Hallmark gene sets to CML initiation and growth. Although the SVD can produce artifacts when applied to time-series data, we observed no oscillations or artifacts, and moreover, we observed a strong correlation between state-space trajectories and BCR::ABL expression as an immunophenotypic marker of disease, supporting our interpretation of the principal component as a transcriptome state-space 31 . We observed that the Es ( c 1 ) state was largely characterized by anti-CML DEGs that resisted CML development whereas the Ts ( c 3 ) and Ls ( c 5 ) disease states were characterized by pro-CML DEGs associated with the movement of state-transition toward late disease state.…”
Section: Discussionsupporting
confidence: 80%
“…Although the correlation with BCR::ABL was by far the best for the CML state-space compared to all other PCs, the correlation was low (R 2 = 0.48; Table S1). Although PCA can be misinterpreted when applied to time-series data with temporal correlations, we are confident in our application and interpretation here because of the correlation with BCR::ABL expression, phenotypic disease manifestation, and the absence of oscillations or other artifacts in the PCs 31,40 . This is due to the orthogonality of the control (tet on) and CML (tet off) mice transcriptome endpoints and dynamics, which enable an orthogonal decomposition of the data and identification of the leukemia signal from the background/noise.…”
Section: Methodsmentioning
confidence: 79%
“…Following head entry, both trajectories deviate from zero, suggesting increased neural activity. Interestingly, they exhibit an initial “twist” (potentially a phantom oscillation because of data smoothing 36 ) between the Go signal and the rat’s exit from the central port. In the Choice epoch, the trajectories undergo a second “twist” and move further away from the origin, reflecting increased engagement.…”
Section: Electrophysiologymentioning
confidence: 99%
“…Sometimes a time-shifting pattern in data (i.e. different neurons with different spike latencies) can result in oscillatory jPCs, even if oscillatory dynamics are not actually in the data 29 . In these cases, all the jPCs show rotations.…”
Section: A Disruptive Saccade Caused Rotations In Subspacementioning
confidence: 99%