Background
Genome wide association studies (GWAS) have identified multiple loci associated with epithelial ovarian cancer (EOC) susceptibility, but further progress requires integration of epidemiology and biology to illuminate true risk loci below genome-wide significance levels (P<5×10−8). Most risk SNPs lie within non-protein-encoding regions, and we hypothesize that long non-coding RNA (lncRNA) genes are enriched at EOC risk regions and represent biologically relevant functional targets.
Methods
Using imputed GWAS data from ~18,000 invasive EOC cases and 34,000 controls of European ancestry, the GENCODE (v19) lncRNA database was used to annotate SNPs from 13,442 lncRNAs for permutation-based enrichment analysis. Tumor expression quantitative trait locus (eQTL) analysis was performed for sub-genome-wide regions (1×10−5>P>5×10−8) overlapping lncRNAs.
Results
Of 5,294 EOC-associated SNPs (P<1.0×10−5), 1,464 (28%) mapped within 53 unique lncRNAs and an additional 3,484 (66%) SNPs were correlated (r2>0.2) with SNPs within 115 lncRNAs. EOC-associated SNPs comprised 130 independent regions, of which 72 (55%) overlapped with lncRNAs, representing a significant enrichment (P = 5.0×10−4) that was more pronounced among a subset of 5,401 lncRNAs with active epigenetic regulation in normal ovarian tissue. EOC-associated lncRNAs and their putative promoters and transcription factors were enriched for biologically relevant pathways and eQTL analysis identified five novel putative risk regions with allele-specific effects on lncRNA gene expression.
Conclusion
lncRNAs are significantly enriched at EOC risk regions suggesting a mechanistic role for lncRNAs in driving predisposition to EOC.
Impact
lncRNAs represent key candidates for integrative epidemiological and functional studies. Further research on their biological role in ovarian cancer is indicated.