Background: To reduce lung cancer burden in the US, a better understanding of biological mechanisms in early disease development could provide new opportunities for risk stratification.
Methods: In a nested case-control study, we measured blood leukocyte DNA methylation levels in pre-diagnostic samples collected from 430 men and women in the 1989 CLUE II cohort. Median time from blood drawn to diagnosis was 14 years for all participants. We compared DNA methylation levels by case/control status to identify novel genomic regions, both single CpG sites and differentially methylated regions (DMRs), while controlling for known DNA methylation changes associated with smoking using a previously described pack-years based smoking methylation score. Stratification analyses were conducted by time from blood draw to diagnosis, histology, and smoking status.
Results: We identified sixteen single CpG sites and forty DMRs significantly associated with lung cancer risk (q < 0.05). The identified genomic regions were associated with genes including H19, HOXA4, RUNX3, BRICD5, PLXNB2, and RP13. For the single CpG sites, the strongest association was noted for cg09736286 in the DIABLO gene (OR [for 1 SD] = 2.99, 95% CI: 1.95-4.59, P-value = 4.81 x 10-7). For the DMRs, we found that CpG sites in the HOXA4 region were hypermethylated in cases compared to controls.
Conclusion: The single CpG sites and DMRs that we identified represented significant measurable differences in lung cancer risk, providing new insights into the biological processes of early lung cancer development and potential biomarkers for lung cancer risk stratification.