2016
DOI: 10.1073/pnas.1617317113
|View full text |Cite
|
Sign up to set email alerts
|

Simultaneous dimension reduction and adjustment for confounding variation

Abstract: Dimension reduction methods are commonly applied to highthroughput biological datasets. However, the results can be hindered by confounding factors, either biological or technical in origin. In this study, we extend principal component analysis (PCA) to propose AC-PCA for simultaneous dimension reduction and adjustment for confounding (AC) variation. We show that AC-PCA can adjust for (i) variations across individual donors present in a human brain exon array dataset and (ii) variations of different species in… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
58
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 47 publications
(58 citation statements)
references
References 26 publications
0
58
0
Order By: Relevance
“…PCA is widely used for data visualization [39][40][41], data QC [42], feature selection [13,[43][44][45][46][47][48][49], de-noising [50,51], imputation [52][53][54], confirmation and removal of batch effects [55][56][57], confirmation and estimation of cell-cycle effects [58], rare cell type detection [59,60], cell type and cell state similarity search [61], pseudotime inference [13,[62][63][64][65][66], and spatial reconstruction [9].…”
Section: Review Of Pca Algorithms and Implementationsmentioning
confidence: 99%
“…PCA is widely used for data visualization [39][40][41], data QC [42], feature selection [13,[43][44][45][46][47][48][49], de-noising [50,51], imputation [52][53][54], confirmation and removal of batch effects [55][56][57], confirmation and estimation of cell-cycle effects [58], rare cell type detection [59,60], cell type and cell state similarity search [61], pseudotime inference [13,[62][63][64][65][66], and spatial reconstruction [9].…”
Section: Review Of Pca Algorithms and Implementationsmentioning
confidence: 99%
“…Finally, as MFMR seeks clusters that are unaffected by confounders like population 271 structure, age or sex, it may be useful for clustering in settings where protecting certain 272 information is important for privacy or fairness [69]. In this sense, MFMR is to GMM 273 roughly as AC-PCA [70] or contrastive PCA [71] are to ordinary PCA. We derive a novel clustering algorithm, multitrait finite mixture of regressions (MFMR), 286 beginning from the standard regression model for interaction.…”
mentioning
confidence: 99%
“…PCA is widely used for data visualization [39][40][41], data QC [42], feature selection [13,[43][44][45][46][47][48][49], de-noising [50,51], imputation [52][53][54], confirmation and removal of batch effects [55][56][57], confirmation and estimation of cell-cycle effects [58], rare cell type detection [59,60], cell type and cell state similarity search [61], pseudotime inference [13,[62][63][64][65][66], and spatial reconstruction [9].…”
Section: Review Of Pca Algorithms and Implementationsmentioning
confidence: 99%