2013
DOI: 10.1101/000752
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Joint analysis of functional genomic data and genome-wide association studies of 18 human traits

Abstract: Annotations of gene structures and regulatory elements can inform genome-wide association studies (GWASs). However, choosing the relevant annotations for interpreting an association study of a given trait remains challenging. I describe a statistical model that uses association statistics computed across the genome to identify classes of genomic elements that are enriched with or depleted of loci influencing a trait. The model naturally incorporates multiple types of annotations. I applied the model to GWASs o… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 57 publications
0
6
0
Order By: Relevance
“…We modeled the probability that a SNP is associated given its functional data using a logistic regression model in which the association status of the individual SNPs in the case blocks is assumed unknown. The model assumes, however, that each case block contains at least one truly associated SNP and that each control block contains none; as in [ 7 , 14 ], the identities of the associated SNPs are not specified as they are unknown (details of the model are provided in Section “Model”); a prior distribution on the overall fraction of associated SNPs was employed to encourage their number to be close to one on average. In this way, the model is free to identify multivariate patterns in the annotation data (‘signatures’) of individual SNPs that would be diluted by averaging the annotation measurements of all SNPs within a block.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We modeled the probability that a SNP is associated given its functional data using a logistic regression model in which the association status of the individual SNPs in the case blocks is assumed unknown. The model assumes, however, that each case block contains at least one truly associated SNP and that each control block contains none; as in [ 7 , 14 ], the identities of the associated SNPs are not specified as they are unknown (details of the model are provided in Section “Model”); a prior distribution on the overall fraction of associated SNPs was employed to encourage their number to be close to one on average. In this way, the model is free to identify multivariate patterns in the annotation data (‘signatures’) of individual SNPs that would be diluted by averaging the annotation measurements of all SNPs within a block.…”
Section: Resultsmentioning
confidence: 99%
“…This may explain, at least in part, the relatively modest nature of the improvements observed in our GWAS analysis and the failure of general summaries of eQTL status to contribute meaningfully to the functional signatures. Indeed, many regulatory processes are cell–type–specific [ 11 , 12 ] and hence will be more informative for a given phenotype if measured in the appropriate context [ 14 ]. However, determining the relevant annotation data, assuming it exists, for a given phenotype requires domain expertise and more careful modeling to create functional signatures.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Mendelian traits [50]. Recent approaches have used functional annotation to boost GWAS power in identifying SNP associations [51], stratify heritability of complex disease by functional annotation [52],…”
Section: Discussionmentioning
confidence: 99%
“…Aggregation was done to keep SNPs with the lowest p-value for each LD block and if multiple with the same, the shortest distance to the effect region (to avoid biases towards distal interactions). The Pickrell LD blocks 44 (available from https://bitbucket.org/nygcresearch/ldetect-data) were used for this analysis (Additional file 1: Fig. S2).…”
Section: Calculating Genetic Variant Distance From Epigenetic Effect ...mentioning
confidence: 99%