2021
DOI: 10.1101/2021.03.08.434451
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Identification of putative causal loci in whole-genome sequencing data via knockoff statistics

Abstract: The analysis of whole-genome sequencing studies is challenging due to the large number of rare variants in noncoding regions and the lack of natural units for testing. We propose a statistical method to detect and localize rare and common risk variants in whole-genome sequencing studies based on a recently developed knockoff framework. It can (1) prioritize causal variants over associations due to linkage disequilibrium thereby improving interpretability; (2) help distinguish the signal due to rare variants fr… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
38
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
2

Relationship

4
1

Authors

Journals

citations
Cited by 7 publications
(39 citation statements)
references
References 65 publications
1
38
0
Order By: Relevance
“…The framework allows for the incorporation of a variety of functional genomics annotations as weights for individual variants included in the tests. Furthermore, a novel aspect of our testing framework is the derivation of knockoff statistics based on the generation of knockoff (synthetic) genetic data that resemble the original genotypes in terms of correlation structure but are conditionally independent of the outcome variable given the true genotypes 8,28,32 . The knockoff genotypes are essentially noisy copies of the original genotypes and serve as negative controls for the original genotype data; they help select important genes while controlling the FDR.…”
Section: Overview Of the Proposed Gene-based Association Testsmentioning
confidence: 99%
See 4 more Smart Citations
“…The framework allows for the incorporation of a variety of functional genomics annotations as weights for individual variants included in the tests. Furthermore, a novel aspect of our testing framework is the derivation of knockoff statistics based on the generation of knockoff (synthetic) genetic data that resemble the original genotypes in terms of correlation structure but are conditionally independent of the outcome variable given the true genotypes 8,28,32 . The knockoff genotypes are essentially noisy copies of the original genotypes and serve as negative controls for the original genotype data; they help select important genes while controlling the FDR.…”
Section: Overview Of the Proposed Gene-based Association Testsmentioning
confidence: 99%
“…Note that the well-known permutation procedure that permutes the samples does not guarantee these exchangeability properties between the original and knockoff genotypes. To generate valid knockoff genotypes we can use a sequential model for knockoff generation that leverages the local patterns of linkage disequilibrium, as previously proposed based on the Hidden Markov Models (HMMs) 32,84 or an auto-regressive model 8 , in such a way that the knockoff genotypes are exchangeable with the original (true) genotypes G but are independent of the phenotype conditional on the original genotypes. The knockoff genotypes serve as negative controls and are designed to mimic the correlation or LD structure found within the original genotypes.…”
Section: Genescan3dknock: Knockoff-enhanced Gene-based Test For Causal Gene Discoverymentioning
confidence: 99%
See 3 more Smart Citations