2018
DOI: 10.1016/j.ymeth.2018.04.021
|View full text |Cite
|
Sign up to set email alerts
|

Variable selection in heterogeneous datasets: A truncated-rank sparse linear mixed model with applications to genome-wide association studies

Abstract: A fundamental and important challenge in modern datasets of ever increasing dimensionality is variable selection, which has taken on renewed interest recently due to the growth of biological and medical datasets with complex, non-i.i.d. structures. Naïvely applying classical variable selection methods such as the Lasso to such datasets may lead to a large number of false discoveries. Motivated by genome-wide association studies in genetics, we study the problem of variable selection for datasets arising from m… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 44 publications
0
1
0
Order By: Relevance
“…The study of population structure allows assigning individuals to distinct ethnic groups cohabiting a particular region (Liu et al 2018), investigating migrations from the origin of admixed populations (Haber et al 2016), and quantifying and characterising confounding due to shared genetic ancestry in association studies (Wang et al 2018). In humans, genetic variation is not randomly distributed across the world because of non-random mating between individuals, who tend to marry within their community, often driven by physical proximity (Schneider and Peischl 2011).…”
Section: Introductionmentioning
confidence: 99%
“…The study of population structure allows assigning individuals to distinct ethnic groups cohabiting a particular region (Liu et al 2018), investigating migrations from the origin of admixed populations (Haber et al 2016), and quantifying and characterising confounding due to shared genetic ancestry in association studies (Wang et al 2018). In humans, genetic variation is not randomly distributed across the world because of non-random mating between individuals, who tend to marry within their community, often driven by physical proximity (Schneider and Peischl 2011).…”
Section: Introductionmentioning
confidence: 99%