2003
DOI: 10.1073/pnas.1530509100
|View full text |Cite
|
Sign up to set email alerts
|

Statistical significance for genomewide studies

Abstract: With the increase in genomewide experiments and the sequencing of multiple genomes, the analysis of large data sets has become commonplace in biology. It is often the case that thousands of features in a genomewide data set are tested against some null hypothesis, where a number of features are expected to be significant. Here we propose an approach to measuring statistical significance in these genomewide studies based on the concept of the false discovery rate. This approach offers a sensible balance between… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

35
8,449
1
12

Year Published

2007
2007
2022
2022

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 8,789 publications
(8,497 citation statements)
references
References 24 publications
35
8,449
1
12
Order By: Relevance
“…The patterns of sharing were also supported by replication between single-tissue cis -eQTLs, estimated by π 1 (the proportion of true positives 16 ) among the eQTLs identified in one tissue and then tested for replication in a second tissue (Extended Data Fig. 7a, median π 1 = 0.740).…”
Section: Tissue-sharing and Specificity Of Eqtlsmentioning
confidence: 75%
See 2 more Smart Citations
“…The patterns of sharing were also supported by replication between single-tissue cis -eQTLs, estimated by π 1 (the proportion of true positives 16 ) among the eQTLs identified in one tissue and then tested for replication in a second tissue (Extended Data Fig. 7a, median π 1 = 0.740).…”
Section: Tissue-sharing and Specificity Of Eqtlsmentioning
confidence: 75%
“…The significance of the most highly associated variant per gene was determined from empirical P values, extrapolated from a Beta distribution fitted to adaptive permutations with the setting –permute 1000 10000. These empirical P values were subsequently corrected for multiple testing across genes using Storey’s q value method 16 . To identify the list of all significant variant–gene pairs associated with eGenes, variants with a nominal P value below the gene-level threshold were considered significant and included in the final list of variant–gene pairs.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…To estimate the evolutionary rate, standard branch models calculated with the CODEML program of PAML 4.6 (Yang, 2007) were used, and an adjusted Chi‐square test (Storey & Tibshirani, 2003) was applied for testing p values.…”
Section: Methodsmentioning
confidence: 99%
“…Q-values were computed by the q-value package of R language and were then used to estimate the false discovery rate (FDR) threshold for the significant p-value of 0.05 [13, 14]. Confidence regions were determined by log of odds ratio (LOD) dropoff 2 from the peak SNPs.…”
Section: Methodsmentioning
confidence: 99%