3Myeloproliferative neoplasms (MPNs) are blood cancers characterized by excessive production of mature myeloid cells that result from the acquisition of somatic driver mutations in hematopoietic stem cells (HSCs) 1 . While substantial progress has been made to define the causal somatic mutation profile for MPNs 2 , epidemiologic studies indicate a significant heritable component for the disease that is among the highest known for all cancers 3 . However, only a limited set of genetic risk loci have been identified, and the underlying biological mechanisms leading to MPN acquisition remain unexplained. Here, to define the inherited risk profile, we conducted the largest genome-wide association study of MPNs to date (978,913 individuals with 3,224 cases) and identified 14 genome-wide significant loci, as well as a polygenic signature that increases the odds for disease acquisition by nearly 3-fold between the top and median deciles. Interestingly, we find a shared genetic architecture between MPN risk and several hematopoietic traits spanning distinct lineages, as well as an association between increased MPN risk and longer leukocyte telomere length, collectively implicating HSC function and self-renewal. Strikingly, we find a significant enrichment for risk variants mapping to accessible chromatin in HSCs compared with other hematopoietic populations. Finally, gene mapping identifies modulators of HSC biology and targeted variant-to-function analyses suggest likely roles for CHEK2 and GFI1B in altering HSC function to confer disease risk. Overall, we demonstrate the power of human genetic studies to illuminate a previously unappreciated mechanism for MPN risk through modulation of HSC function.
4
Main Text:We initially sought to characterize the germline genetic architecture that confers risk for MPNs, and therefore conducted a genome-wide association study (GWAS) meta-analysis using three population-based cohorts (UK Biobank (UKBB), 23andMe, and FinnGen). The combined sample size comprised 2,627 MPN cases and 755,476 controls, more than doubling the number of cases from prior studies (Supplementary Table 1, Extended Data Fig. 1). We tested 7,329,649 well-represented variants passing central and studyspecific quality control measures. Linkage disequilibrium score regression (LDSC) 4 showed negligible inflation in test statistics due to population structure, with an intercept of 1.005 and genomic control factor of 1.0255. We estimate the narrow-sense heritability for MPN risk to be 0.0717 (s.e. = 0.0306) on the liability scale, suggesting that ~7% of variance in MPN risk can be attributed to common genetic variation (MAF > 1%), assuming a disease prevalence of 0.0328%, as reported in population studies 5,6 .Analysis of GWAS signals using GCTA 7 revealed 12 linkage disequilibrium (LD)independent loci at genome-wide significance (p < 5 x 10 -8 ) and an additional 16 loci with suggestive associations (p < 1 x 10 -6 ) ( Fig. 1a, Supplementary Table 2). Previous studies have shown that carriers of the somatic JAK2 V617...