2022
DOI: 10.21203/rs.3.rs-1298372/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

An empirical comparison between polygenic risk scores and machine learning for case/control classification

Abstract: BackgroundWe compared the procedure to calculate polygenic risk scores and machine learning for simulated data, devised a way to compare machine learning results with PRS, and highlighted the required files formats for PRS calculation and machine learning model training. For PRS calculation, we used three tools: Plink, PRSice, and Lassosum, and for the machine learning algorithm, we used artificial neural networks. ResultsBased on our survey, we cannot say machine learning is better or polygenic risk scores be… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…However, the linear polygenic models do not have the sufficient expressive capacity to learn and transfer complex representations across subpopulations with different genetic architectures. Recent studies indicate that the deep learning models capable of capturing complex nonlinear interactions generally outperform the linear disease prediction models (83)(84)(85).…”
Section: P(yx) = P(y|x) • P(x)mentioning
confidence: 99%
“…However, the linear polygenic models do not have the sufficient expressive capacity to learn and transfer complex representations across subpopulations with different genetic architectures. Recent studies indicate that the deep learning models capable of capturing complex nonlinear interactions generally outperform the linear disease prediction models (83)(84)(85).…”
Section: P(yx) = P(y|x) • P(x)mentioning
confidence: 99%
“…These algorithms have already been used for genotype-phenotype [48][49][50] prediction making them applicate for genotype-phenotype transfer learning.…”
Section: Recurrent Neural Networkmentioning
confidence: 99%
“…Such tools usually adapt complicated machine learning models that may consider nonlinear analysis as well as causality assumptions or inference (Muneeb et al, 2022;Meijering and Gianola, 1985;Sailer and Harms, 2017;Bao et al, 2020;Basu et al, 2018;Lee et al, 2016).…”
Section: Introductionmentioning
confidence: 99%
“…These data exposed the challenges of handling correlations between in-between-ome terms (e.g., co-expressions in the transcriptome), however they also provide opportunities (Wainberg et al ., 2019) These data have triggered development of sophisticated tools leveraging in-between-omes to characterize the genetic basis of complex traits. Such tools usually adapt complicated machine learning models that may consider nonlinear analysis as well as causality assumptions or inference (Muneeb et al ., 2022; Meijering and Gianola, 1985; Sailer and Harms, 2017; Bao et al ., 2020; Basu et al ., 2018; Lee et al ., 2016). However, there are no standard simulators to benchmark the performance of the newly developed tools, leaving authors to develop different ad hoc simulations tailoring to their works.…”
Section: Introductionmentioning
confidence: 99%