2020
DOI: 10.1093/bioinformatics/btaa229
|View full text |Cite
|
Sign up to set email alerts
|

Prioritizing genetic variants in GWAS with lasso using permutation-assisted tuning

Abstract: Motivation Large scale genome-wide association studies (GWAS) have resulted in the identification of a wide range of genetic variants related to a host of complex traits and disorders. Despite their success, the individual single-nucleotide polymorphism (SNP) analysis approach adopted in most current GWAS can be limited in that it is usually biologically simple to elucidate a comprehensive genetic architecture of phenotypes and statistically underpowered due to heavy multiple-testing correcti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
14
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
8
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 17 publications
(14 citation statements)
references
References 38 publications
0
14
0
Order By: Relevance
“…Thus, the fitted regression model does not include all the predictors in the original dataset but only those variables with the strongest effect on the molecular phenotype, circumventing the necessity for explicit multiple testing correction; this leads more interpretable prediction models and also prevents overfitting of the model. The use of LASSO is increasingly common in the field of genetic association studies [ 2–5 ].…”
Section: Introductionmentioning
confidence: 99%
“…Thus, the fitted regression model does not include all the predictors in the original dataset but only those variables with the strongest effect on the molecular phenotype, circumventing the necessity for explicit multiple testing correction; this leads more interpretable prediction models and also prevents overfitting of the model. The use of LASSO is increasingly common in the field of genetic association studies [ 2–5 ].…”
Section: Introductionmentioning
confidence: 99%
“…Such choices, however, may not be appropriate for the relatively small sample sizes frequently encountered in most current microbiome association analyses. Among some popular feature selection tools ( 19 , 26 , 27 , 46 ) evaluated in our numerical studies, the feature selection strategy outlined in Algorithm 1 had the optimum performance within the context of the current paper (see details in Section 1 of the online Supplementary Data ). It is of future research interest to further boost the power of AMAT by sharping the intermediate feature selection tool embedded in it.…”
Section: Discussionmentioning
confidence: 94%
“…For example, they ignore feature dependence and nonlinear structure, lack flexibility, and require a large sample size. When implementing these methods, people tend to assume a parametric form (like linear model and logistic regression model) to describe the relationship between the response and the features and ignore the interactions among features, which, however, can be complex and nonlinear in practice (Wu et al, 2009;Yang et al, 2020). Despite that both supervised and unsupervised feature selections have important applications, most algorithms cannot combine supervised and unsupervised learnings effectively (see Ding, 2003;Varshavsky et al, 2006;Qi et al, 2018;Zhu et al, 2018;Taherkhani et al, 2018;Solorio-Fernández et al, 2020).…”
Section: Introductionmentioning
confidence: 99%