2022
DOI: 10.1186/s13059-022-02761-4
|View full text |Cite
|
Sign up to set email alerts
|

PCA outperforms popular hidden variable inference methods for molecular QTL mapping

Abstract: Background Estimating and accounting for hidden variables is widely practiced as an important step in molecular quantitative trait locus (molecular QTL, henceforth “QTL”) analysis for improving the power of QTL identification. However, few benchmark studies have been performed to evaluate the efficacy of the various methods developed for this purpose. Results Here we benchmark popular hidden variable inference methods including surrogate variable a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 35 publications
(30 citation statements)
references
References 54 publications
0
22
0
Order By: Relevance
“…To balance the discovery power and potential false positives, there are different methods to determine the optimal number of factors. Two commonly used methods are the automatic elbow detection method and Buja and Eyuboglu (BE) algorithm, which has been comprehensively evaluated for PCA in bulk RNA data [ 16 ]. We ran these two methods for PCs inferred from the single-cell data for each QC option (see the “ Methods ” section).…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…To balance the discovery power and potential false positives, there are different methods to determine the optimal number of factors. Two commonly used methods are the automatic elbow detection method and Buja and Eyuboglu (BE) algorithm, which has been comprehensively evaluated for PCA in bulk RNA data [ 16 ]. We ran these two methods for PCs inferred from the single-cell data for each QC option (see the “ Methods ” section).…”
Section: Resultsmentioning
confidence: 99%
“…These findings suggest that either PCs or PFs for single-cell eQTL mapping can be used to improve the number of eGene discoveries. However, the computational burden and flexibility are different between these two methods [ 13 , 16 ].…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We used the genotypes' first three principal components (PCs) as covariates, and the gene expression's PCs to account for technical and other co-variabilities in the RNA-Seq data, as suggested before 36,37 . We optimised the number of PCs used as covariates by maximising the number of genes with significant (FDR <5%) cis-QTLs, i.e., the first 70 PCs for exQTLs, the first 50 PCs for inQTLs, and the first 40 PCs for ex-inQTLs.…”
Section: Mapping Of Cis-qtlsmentioning
confidence: 99%