2016
DOI: 10.1038/srep37444
|View full text |Cite
|
Sign up to set email alerts
|

A Nonparametric Regression Approach to Control for Population Stratification in Rare Variant Association Studies

Abstract: Recently, there is increasing interest to detect associations between rare variants and complex traits. Rare variant association studies usually need large sample sizes due to the rarity of the variants, and large sample sizes typically require combining information from different geographic locations within and across countries. Although several statistical methods have been developed to control for population stratification in common variant association studies, these methods are not necessarily controlling … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 39 publications
0
12
0
Order By: Relevance
“…However, logistic regression may not be the most powerful analysis strategy for rare variants and so higher inflation might be observed with statistical approaches designed for rare variants. Since our model simulations included only a single candidate SNP, we could not evaluate any rare variant methods, such as SKAT (Wu et al, 2011), which can be modified to correct for ancestry (Luo et al, 2018), or evaluate any ancestry correction methods designed for rare variants, such as Sha et al (2016). Given the slight false positive rate inflation that we saw even after including PCs in the logistic regression model (Balding-Nichols simulation), a more in-depth exploration of PC-based corrections under EPS when there are rare variants is needed.…”
Section: Discussionmentioning
confidence: 99%
“…However, logistic regression may not be the most powerful analysis strategy for rare variants and so higher inflation might be observed with statistical approaches designed for rare variants. Since our model simulations included only a single candidate SNP, we could not evaluate any rare variant methods, such as SKAT (Wu et al, 2011), which can be modified to correct for ancestry (Luo et al, 2018), or evaluate any ancestry correction methods designed for rare variants, such as Sha et al (2016). Given the slight false positive rate inflation that we saw even after including PCs in the logistic regression model (Balding-Nichols simulation), a more in-depth exploration of PC-based corrections under EPS when there are rare variants is needed.…”
Section: Discussionmentioning
confidence: 99%
“…Sha et al. () show the effectiveness of PC‐nonp in controlling for population substructure with burden RV tests, and it is of great interest to evaluate the performance of PC‐nonp with SKAT. However, in our preliminary explorations, it seems that SKAT‐PC‐nonp was not able to control for confounding regardless which type of variants are used to obtain PCs (data not shown).…”
Section: Discussionmentioning
confidence: 99%
“…Unlike the typical PC-based correction methods that assume the top PCs have a linear effect on the traits, PC-nonp uses a nonparametric regression to model the potential nonlinear or complex effects of the PCs. Sha et al (2016) show the effectiveness of PC-nonp in controlling for population substructure with burden RV tests, and it is of great interest to evaluate the performance of PC-nonp with (from top to bottom) no confounders, confounders with a discrete spatial distribution, and confounders with a continuous spatial distribution. Different columns indicate the types of variants used to construct PC/VC, i.e., RV for rare variants, LFV for less frequent variants, CV for common variants, and AV for all variants SKAT.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…All these burden tests assume all the variants share the same effect direction and magnitude (after incorporating weights). Thus, any violation of this assumption can result in a loss of power 8 , 10 , 35 . To overcome the limitations of the burden tests, the data-adaptive sum test (aSum) was proposed 36 .…”
Section: Methodsmentioning
confidence: 99%