2022
DOI: 10.1101/2022.11.10.22282137
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Correcting for volunteer bias in GWAS uncovers novel genetic variants and increases heritability estimates

Abstract: The implications of selection bias due to volunteering (volunteer bias) for genetic association studies are poorly understood. Because of its large sample size and extensive phenotyping, the UK Biobank (UKB) is included in almost all large genomewide association studies (GWAS) to date, as it is one of the largest cohorts. Yet, it is known to be highly selected. We develop inverse probability weighted GWAS (WGWAS) to estimate GWAS summary statistics in the UKB that are corrected for volunteer bias. WGWAS decrea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 39 publications
0
6
0
Order By: Relevance
“…In contrast, the MR is based on GWASs of international samples who are volunteers of European-ancestry. Both higher risk for mental illness and lower EA are known to increase non-participation and participant attrition [56][57][58][59] , a selection bias that could induce a collider bias 60 . In the withinsibship analysis, we rely on diagnoses by professionals in specialized care, therefore dependent on the current referral policies of the Dutch healthcare system.…”
Section: Limitationsmentioning
confidence: 99%
“…In contrast, the MR is based on GWASs of international samples who are volunteers of European-ancestry. Both higher risk for mental illness and lower EA are known to increase non-participation and participant attrition [56][57][58][59] , a selection bias that could induce a collider bias 60 . In the withinsibship analysis, we rely on diagnoses by professionals in specialized care, therefore dependent on the current referral policies of the Dutch healthcare system.…”
Section: Limitationsmentioning
confidence: 99%
“…ensure statistical significance while having limited or quantifiable confounders (e.g. batch effects or socioeconomic factors [79]), a requirement of any evaluatory dataset [25].…”
Section: Introductionmentioning
confidence: 99%
“…Our goal is to provide a set of benchmarking tasks for genomic AI that A) allow for direct, supervised training of high-complexity models from scratch (tabula rasa) for comparison to pretraining regimes for transfer learning, and B) ensure statistical significance while having limited or quantifiable confounders (e.g. batch effects or socioeconomic factors [79]), a requirement of any evaluatory dataset [25]. GUANinE v0.9 prioritizes functional genomic annotation and understanding on short-to-moderate length sequences (between 80 and 512 nucleotides), rather than exploring long sequences inputs or distal effects [39,21,37].…”
Section: Introductionmentioning
confidence: 99%
“…In contrast, the MR is based on GWASs of international samples who are volunteers of European-ancestry. Both higher risk for mental illness and lower EA are known to increase non-participation and participant attrition [261][262][263][264] , a selection bias that could induce a collider bias 265 . In the within-sibship analysis, we rely on diagnoses by professionals in specialized care, therefore dependent on the current referral policies of the Dutch healthcare system.…”
Section: Limitationsmentioning
confidence: 99%
“…Correction for the non-representativity of these cohorts can be partially done, for example weights for correcting the important volunteer bias of UK Biobank have recently been made available recently 263,264,377 . Similar efforts should be undertaken for all large cohorts on which most of the current genetically-informed studies are taking place.…”
Section: Internal Validity Is Not Enough: a Word Of Caution On Extern...mentioning
confidence: 99%