2020
DOI: 10.1371/journal.pcbi.1007819
|View full text |Cite
|
Sign up to set email alerts
|

DOT: Gene-set analysis by combining decorrelated association statistics

Abstract: Historically, the majority of statistical association methods have been designed assuming availability of SNP-level information. However, modern genetic and sequencing data present new challenges to access and sharing of genotype-phenotype datasets, including cost management, difficulties in consolidation of records across research groups, etc. These issues make methods based on SNP-level summary statistics particularly appealing. The most common form of combining statistics is a sum of SNP-level squared score… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 81 publications
0
8
0
Order By: Relevance
“…which increases with the increased heterogeneity of µ. More details on the performance of the decorrelation approach are given by us elsewhere, 19 but here we briefly note that this finding is practically relevant because substantial heterogeneity of associations is expected among genetic variants, leading to a substantial power boost, as we next illustrate via re-analysis of published associations of genetic variants within the µ-opioid gene with pain sensitivity.…”
Section: Simulation Study Resultsmentioning
confidence: 76%
See 1 more Smart Citation
“…which increases with the increased heterogeneity of µ. More details on the performance of the decorrelation approach are given by us elsewhere, 19 but here we briefly note that this finding is practically relevant because substantial heterogeneity of associations is expected among genetic variants, leading to a substantial power boost, as we next illustrate via re-analysis of published associations of genetic variants within the µ-opioid gene with pain sensitivity.…”
Section: Simulation Study Resultsmentioning
confidence: 76%
“…Theoretical properties and extensive numerical evaluation of DOT will be published elsewhere and currently these findings are available as a preprint. 19 Further, we illustrate an application of our methods with analyses of variants within the µ-opioid gene that have been shown to affect sensitivity to pain. We find strengthened evidence of overall association within the 11-SNP block.…”
Section: Discussionmentioning
confidence: 99%
“…[49][50][51] Summary statistics can also be decorrelated before being summed together, which is powerful under heterogeneity of effect sizes and variation between pairwise LD patterns. 49 The stage 1 test implemented here has the same functional form as that used in VEGAS 30 and fastBAT. 31 This set-based test can be more powerful than a test which takes the maximum of the chi-squared test statistics in a region, an approach implemented in GATES 52 and Pascal-Max, 53 especially when there are multiple independent association signals.…”
Section: Discussionmentioning
confidence: 99%
“…These include versions with weighted sums of summary statistics, known as gene set analysis (GSA) tests or burden tests for rare variants. [49][50][51] Summary statistics can also be decorrelated before being summed together, which is powerful under heterogeneity of effect sizes and variation between pairwise LD patterns. 49 The stage 1 test implemented here has the same functional form as that used in VEGAS 30 and fastBAT.…”
Section: Discussionmentioning
confidence: 99%
“…Similar to other simulation studies [33], we utilized realistic linkage disequilibrium patterns by using data from the 1000 Genomes Project [34] for a 100 Kb region of chromosome 17 which included 12,735 SNPs spanning from the FGF11 gene to the NDEL1 gene. The selection of this region of the genome was arbitrary, but we expect the linkage disequilibrium structure of this region to be representative of, and generalizable to, other regions of the genome.…”
Section: Data Simulationmentioning
confidence: 99%