2016
DOI: 10.1038/srep36671
|View full text |Cite
|
Sign up to set email alerts
|

Combining Multiple Hypothesis Testing with Machine Learning Increases the Statistical Power of Genome-wide Association Studies

Abstract: The standard approach to the analysis of genome-wide association studies (GWAS) is based on testing each position in the genome individually for statistical significance of its association with the phenotype under investigation. To improve the analysis of GWAS, we propose a combination of machine learning and statistical testing that takes correlation structures within the set of SNPs under investigation in a mathematically well-controlled manner into account. The novel two-step algorithm, COMBI, first trains … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
105
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 63 publications
(106 citation statements)
references
References 71 publications
1
105
0
Order By: Relevance
“…To date, no PRS for ND that combines both common and rare variants has been calculated, which may lead to an increase of their accuracy. Novel machine learning methods are being developed to capture compact representations of GWAS [79] and coupled with powerful classification methods, namely deep neural networks, to produce highly accurate predictions [80][81][82]. The parallel development of larger cohorts, molecular phenotyping and the advancement of novel and more sophisticated methods to represent and operate multi-variant models, will allow a more precise discernment of the overlap of the genetic architecture among ND and predict individuals at risk with the accuracy required in clinical settings.…”
Section: Discussionmentioning
confidence: 99%
“…To date, no PRS for ND that combines both common and rare variants has been calculated, which may lead to an increase of their accuracy. Novel machine learning methods are being developed to capture compact representations of GWAS [79] and coupled with powerful classification methods, namely deep neural networks, to produce highly accurate predictions [80][81][82]. The parallel development of larger cohorts, molecular phenotyping and the advancement of novel and more sophisticated methods to represent and operate multi-variant models, will allow a more precise discernment of the overlap of the genetic architecture among ND and predict individuals at risk with the accuracy required in clinical settings.…”
Section: Discussionmentioning
confidence: 99%
“…When tailored for understanding GWAS data, ML predictions can provide an improved statistical foundation of evidence to support or improve GWAS results. For instance, ML in GWAS has been applied to identify loci, increase the statistical power of GWAS (Mieth et al, 2016), detect epistatic interactions (Leem et al, 2014), improve polygenic risk scoring produced from GWAS (Pare et al, 2017), and prioritize genes and variants on post-GWAS analysis (Vitsios and Petrovski, 2019). Here we will focus on the ML applications developed for post-GWAS prioritization.…”
Section: Introductionmentioning
confidence: 99%
“…Taking advantage of it will depend on state-of-the-art mathematical and statistical approaches capable of incorporating large numbers of single-nucleotide polymorphisms into risk models, artificial intelligence, and supervised machine learning approaches. 98,99…”
Section: Genetic Predisposition To Lung Cancermentioning
confidence: 99%