2023
DOI: 10.1101/2023.01.24.23284898
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Efficacy of federated learning on genomic data: a study on the UK Biobank and the 1000 Genomes Project

Abstract: Combining training data from multiple sources increases sample size and reduces confounding, leading to more accurate and less biased machine learning models. In healthcare, however, direct pooling of data is often not allowed by data custodians who are accountable for minimizing the exposure of sensitive information. Federated learning offers a promising solution to this problem by training a model in a decentralized manner thus reducing the risks of data leakage. Although there is increasing utilization of f… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 63 publications
0
1
0
Order By: Relevance
“…Linkage disequilibrium (LD) was also assessed using the threshold for LD pruning (r 2 > 0.8) (excluding 4 SNPs, n = 68 SNPs). This study did not include any ambiguous, palindromic SNPs or SNPs with an imputation score < 0.8 [36]. Finally, only the SNPs that passed the SNP QC from the Caucasian GWAS (n = 40 SNPs) were included in this study.…”
Section: Derivation Of the Polygenic Risk Score (Prs)mentioning
confidence: 99%
“…Linkage disequilibrium (LD) was also assessed using the threshold for LD pruning (r 2 > 0.8) (excluding 4 SNPs, n = 68 SNPs). This study did not include any ambiguous, palindromic SNPs or SNPs with an imputation score < 0.8 [36]. Finally, only the SNPs that passed the SNP QC from the Caucasian GWAS (n = 40 SNPs) were included in this study.…”
Section: Derivation Of the Polygenic Risk Score (Prs)mentioning
confidence: 99%