2015
DOI: 10.1038/ng.3190
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Bayesian mixed-model analysis increases association power in large cohorts

Abstract: Linear mixed models are a powerful statistical tool for identifying genetic associations and avoiding confounding. However, existing methods are computationally intractable in large cohorts, and may not optimize power. All existing methods require time cost O(MN2) (where N = #samples and M = #SNPs) and implicitly assume an infinitesimal genetic architecture in which effect sizes are normally distributed, which can limit power. Here, we present a far more efficient mixed model association method, BOLT-LMM, whic… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

12
1,584
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 1,451 publications
(1,596 citation statements)
references
References 49 publications
(141 reference statements)
12
1,584
0
Order By: Relevance
“…Discovery GWAS analysis was performed on UK Biobank on V.3 imputed genotypes using BOLT-LMM V.2.34, which applies a linear mixed model to adjust for the effects of population structure and individual relatedness 16. This enabled the inclusion of all related individuals in our white European subset allowing a sample size of 451 099 individuals as detailed in online supplementary materials and methods.…”
Section: Methodsmentioning
confidence: 99%
“…Discovery GWAS analysis was performed on UK Biobank on V.3 imputed genotypes using BOLT-LMM V.2.34, which applies a linear mixed model to adjust for the effects of population structure and individual relatedness 16. This enabled the inclusion of all related individuals in our white European subset allowing a sample size of 451 099 individuals as detailed in online supplementary materials and methods.…”
Section: Methodsmentioning
confidence: 99%
“…26 This approach corrects for all levels of inter-individual correlation of genotypes due to relatedness, from close relatives to cryptic relatedness caused by population stratification. We inverse normalised the socioeconomic status measures, then took the residuals using three covariates (age, sex, assessment centre location) and inverse normalised again.…”
Section: Sensitivity Analysesmentioning
confidence: 99%
“…Other methods, including [17, 19, 18], can always raise the corresponding Kinship matrix to achieve a similar performance. In other words, the novelty of our paper can also be understood as introducing a general framework with the argument that raising the power of kinship matrix can effectively increase the power of confounding correction.…”
Section: Resultsmentioning
confidence: 99%
“…To address this challenge, Principle Component Analysis that extracts the major axes of population differentiation from genotype data has been explored [11]. Linear mixed model is another tool that provides a more dedicated control of modeling population structure with its random effect component and it has been shown to greatly reduce the impact of population structure [12, 13, 14, 15, 16, 17, 18, 19, 20]. …”
Section: Introductionmentioning
confidence: 99%