Breast cancer risk is influenced by rare coding variants in susceptibility genes such as BRCA1 and many common, mainly non-coding variants. However, much of the genetic contribution to breast cancer risk remains unknown. We report results from a genome-wide association study (GWAS) of breast cancer in 122,977 cases and 105,974 controls of European ancestry and 14,068 cases and 13,104 controls of East Asian ancestry1. We identified 65 new loci associated with overall breast cancer at p<5x10-8. The majority of credible risk SNPs in the new loci fall in distal regulatory elements, and by integrating in-silico data to predict target genes in breast cells at each locus, we demonstrate a strong overlap between candidate target genes and somatic driver genes in breast tumours. We also find that heritability of breast cancer due to all SNPs in regulatory features was 2-5-fold enriched relative to the genome-wide average, with strong enrichment for particular transcription factor binding sites. These results provide further insight into genetic susceptibility to breast cancer and will improve the utility of genetic risk scores for individualized screening and prevention.
Genome wide association studies (GWAS) and large scale replication studies have identified common variants in 79 loci associated with breast cancer, explaining ~14% of the familial risk of the disease. To identify new susceptibility loci, we performed a meta-analysis of 11 GWAS comprising of 15,748 breast cancer cases and 18,084 controls, and 46,785 cases and 42,892 controls from 41 studies genotyped on a 200K custom array (iCOGS). Analyses were restricted to women of European ancestry. Genotypes for more than 11M SNPs were generated by imputation using the 1000 Genomes Project reference panel. We identified 15 novel loci associated with breast cancer at P<5×10−8. Combining association analysis with ChIP-Seq data in mammary cell lines and ChIA-PET chromatin interaction data in ENCODE, we identified likely target genes in two regions: SETBP1 on 18q12.3 and RNF115 and PDZK1 on 1q21.1. One association appears to be driven by an amino-acid substitution in EXO1.
We analyzed 3,872 common genetic variants across the
ESR1 locus (encoding estrogen receptor α) in
118,816 subjects from three international consortia. We found evidence for at
least five independent causal variants, each associated with different phenotype
sets, including estrogen receptor (ER+ or
ER−) and human ERBB2 (HER2+ or
HER2−) tumor subtypes, mammographic density and tumor
grade. The best candidate causal variants for ER− tumors lie
in four separate enhancer elements, and their risk alleles reduce expression of
ESR1, RMND1 and CCDC170,
whereas the risk alleles of the strongest candidates for the remaining
independent causal variant disrupt a silencer element and putatively increase
ESR1 and RMND1 expression.
In a three-stage genome-wide association study among East Asian women including 22,780 cases and 24,181 controls, we identified three novel genetic loci associated with breast cancer risk, including rs4951011 at 1q32.1 (in intron 2 of the ZC3H11A gene, P = 8.82 × 10−9), rs10474352 at 5q14.3 (near the ARRDC3 gene, P = 1.67 × 10−9), and rs2290203 at 15q26.1 (in intron 14 of the PRC1 gene, P = 4.25 × 10−8). These associations were replicated in European-ancestry populations including 16,003 cases and 41,335 controls (P = 0.030, 0.004, and 0.010, respectively). Data from the ENCODE project suggest that variants rs4951011 and rs10474352 may be located in an enhancer region and transcription factor binding sites, respectively. This study provides additional insights into the genetics and biology of breast cancer.
Genome-wide association studies (GWASs) have revealed increased breast cancer risk associated with multiple genetic variants at 5p12. Here, we report the fine mapping of this locus using data from 104,660 subjects from 50 case-control studies in the Breast Cancer Association Consortium (BCAC). With data for 3,365 genotyped and imputed SNPs across a 1 Mb region (positions 44,394,495-45,364,167; NCBI build 37), we found evidence for at least three independent signals: the strongest signal, consisting of a single SNP rs10941679, was associated with risk of estrogen-receptor-positive (ER) breast cancer (per-g allele OR ER = 1.15; 95% CI 1.13-1.18; p = 8.35 × 10). After adjustment for rs10941679, we detected signal 2, consisting of 38 SNPs more strongly associated with ER-negative (ER) breast cancer (lead SNP rs6864776: per-a allele OR ER = 1.10; 95% CI 1.05-1.14; p conditional = 1.44 × 10), and a single signal 3 SNP (rs200229088: per-t allele OR ER = 1.12; 95% CI 1.09-1.15; p conditional = 1.12 × 10). Expression quantitative trait locus analysis in normal breast tissues and breast tumors showed that the g (risk) allele of rs10941679 was associated with increased expression of FGF10 and MRPS30. Functional assays demonstrated that SNP rs10941679 maps to an enhancer element that physically interacts with the FGF10 and MRPS30 promoter regions in breast cancer cell lines. FGF10 is an oncogene that binds to FGFR2 and is overexpressed in ∼10% of human breast cancers, whereas MRPS30 plays a key role in apoptosis. These data suggest that the strongest signal of association at 5p12 is mediated through coordinated activation of FGF10 and MRPS30, two candidate genes for breast cancer pathogenesis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.