2019
DOI: 10.1101/810713
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Fine-tuning Polygenic Risk Scores with GWAS Summary Statistics

Abstract: Polygenic risk scores (PRSs) have wide applications in human genetics research. Notably, most PRS models include tuning parameters which improve predictive performance when properly selected. However, existing model-tuning methods require validation data that is independent with both training and testing samples. These data rarely exist in practice, creating a significant gap between PRS methodology and applications. Here, we introduce PUMAS, a novel method to finetune PRS models using summary statistics from … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
4
1

Relationship

5
4

Authors

Journals

citations
Cited by 15 publications
(22 citation statements)
references
References 43 publications
0
22
0
Order By: Relevance
“…Given X T Y/n, the estimate of γ from all n training samples, our aim is to generate X A T Y A /n A an estimate of γ from n A samples (where n A =n-n E ). Our approach is similar to that of Zhao et al, 43 who propose sampling X A T Y A /n A from N(X T Y/n, n E /n A V/n), where V is the variance of X T Y. However, while Zhao et al restrict to independent SNPs, and subsequently derive V = I + X T YY T X/n 2 , where I is an identity matrix, we instead use V = X T X, as proposed by Zhu and Stephens.…”
Section: Methodsmentioning
confidence: 94%
See 1 more Smart Citation
“…Given X T Y/n, the estimate of γ from all n training samples, our aim is to generate X A T Y A /n A an estimate of γ from n A samples (where n A =n-n E ). Our approach is similar to that of Zhao et al, 43 who propose sampling X A T Y A /n A from N(X T Y/n, n E /n A V/n), where V is the variance of X T Y. However, while Zhao et al restrict to independent SNPs, and subsequently derive V = I + X T YY T X/n 2 , where I is an identity matrix, we instead use V = X T X, as proposed by Zhu and Stephens.…”
Section: Methodsmentioning
confidence: 94%
“…Therefore, we instead create "pseudo" partial summary statistics. 43 Let γ = (γ 1 , γ 2 , …, γ m ) T denote the vector of true SNP effect sizes from single-SNP analysis (note that γ j will usually differ from β j , because β j reflects how much SNP j contributes directly to the phenotype, whereas γ j reflects how much contribution it tags). Given X T Y/n, the estimate of γ from all n training samples, our aim is to generate X A T Y A /n A an estimate of γ from n A samples (where n A =n-n E ).…”
Section: Methodsmentioning
confidence: 99%
“…The phenotypes for the association analysis were chosen based on the feasibility of the CSF metabolome being relevant to the phenotype and the availability of GWAS summary statistics for the phenotype. The only exception was the GWAS for the AD proxy phenotype, which was developed in-house on the UK Biobank data set as a surrogate measure for AD risk based on parental diagnosis and age at diagnosis, following previous research 100,101 . The CNS phenotypes and sources of the GWAS summary statistics are listed in Supplementary Table 9.…”
Section: Metabolite-phenotype Association Testingmentioning
confidence: 99%
“…We calculated PRS of ASD and other 67 complex traits ( Table S1 ) for all parents and simulated offspring. Each PRS was statistically fine-tuned from GWAS summary statistics using PUMAS [10] ( Methods ) and standardized by cohort using the mean and standard deviation of parental scores.…”
Section: Resultsmentioning
confidence: 99%