Breast cancer exhibits familial aggregation, consistent with variation in genetic susceptibility to the disease. Known susceptibility genes account for less than 25% of the familial risk of breast cancer, and the residual genetic variance is likely to be due to variants conferring more moderate risks. To identify further susceptibility alleles, we conducted a two-stage genome-wide association study in 4,398 breast cancer cases and 4,316 controls, followed by a third stage in which 30 single nucleotide polymorphisms (SNPs) were tested for confirmation in 21,860 cases and 22,578 controls from 22 studies. We used 227,876 SNPs that were estimated to correlate with 77% of known common SNPs in Europeans at r2 > 0.5. SNPs in five novel independent loci exhibited strong and consistent evidence of association with breast cancer (P < 10(-7)). Four of these contain plausible causative genes (FGFR2, TNRC9, MAP3K1 and LSP1). At the second stage, 1,792 SNPs were significant at the P < 0.05 level compared with an estimated 1,343 that would be expected by chance, indicating that many additional common susceptibility alleles may be identifiable by this approach.
The vast majority of coding variants are rare, and assessment of the contribution of rare variants to complex traits is hampered by low statistical power and limited functional data. Improved methods for predicting the pathogenicity of rare coding variants are needed to facilitate the discovery of disease variants from exome sequencing studies. We developed REVEL (rare exome variant ensemble learner), an ensemble method for predicting the pathogenicity of missense variants on the basis of individual tools: MutPred, FATHMM, VEST, PolyPhen, SIFT, PROVEAN, MutationAssessor, MutationTaster, LRT, GERP, SiPhy, phyloP, and phastCons. REVEL was trained with recently discovered pathogenic and rare neutral missense variants, excluding those previously used to train its constituent tools. When applied to two independent test sets, REVEL had the best overall performance (p < 10) as compared to any individual tool and seven ensemble methods: MetaSVM, MetaLR, KGGSeq, Condel, CADD, DANN, and Eigen. Importantly, REVEL also had the best performance for distinguishing pathogenic from rare neutral variants with allele frequencies <0.5%. The area under the receiver operating characteristic curve (AUC) for REVEL was 0.046-0.182 higher in an independent test set of 935 recent SwissVar disease variants and 123,935 putatively neutral exome sequencing variants and 0.027-0.143 higher in an independent test set of 1,953 pathogenic and 2,406 benign variants recently reported in ClinVar than the AUCs for other ensemble methods. We provide pre-computed REVEL scores for all possible human missense variants to facilitate the identification of pathogenic variants in the sea of rare variants discovered as sequencing studies expand in scale.
BACKGROUND A high body-mass index (BMI, the weight in kilograms divided by the square of the height in meters) is associated with increased mortality from cardiovascular disease and certain cancers, but the precise relationship between BMI and all-cause mortality remains uncertain. METHODS We used Cox regression to estimate hazard ratios and 95% confidence intervals for an association between BMI and all-cause mortality, adjusting for age, study, physical activity, alcohol consumption, education, and marital status in pooled data from 19 prospective studies encompassing 1.46 million white adults, 19 to 84 years of age (median, 58). RESULTS The median baseline BMI was 26.2. During a median follow-up period of 10 years (range, 5 to 28), 160,087 deaths were identified. Among healthy participants who never smoked, there was a J-shaped relationship between BMI and all-cause mortality. With a BMI of 22.5 to 24.9 as the reference category, hazard ratios among women were 1.47 (95 percent confidence interval [CI], 1.33 to 1.62) for a BMI of 15.0 to 18.4; 1.14 (95% CI, 1.07 to 1.22) for a BMI of 18.5 to 19.9; 1.00 (95% CI, 0.96 to 1.04) for a BMI of 20.0 to 22.4; 1.13 (95% CI, 1.09 to 1.17) for a BMI of 25.0 to 29.9; 1.44 (95% CI, 1.38 to 1.50) for a BMI of 30.0 to 34.9; 1.88 (95% CI, 1.77 to 2.00) for a BMI of 35.0 to 39.9; and 2.51 (95% CI, 2.30 to 2.73) for a BMI of 40.0 to 49.9. In general, the hazard ratios for the men were similar. Hazard ratios for a BMI below 20.0 were attenuated with longer-term follow-up. CONCLUSIONS In white adults, overweight and obesity (and possibly underweight) are associated with increased all-cause mortality. All-cause mortality is generally lowest with a BMI of 20.0 to 24.9.
Breast cancer risk is influenced by rare coding variants in susceptibility genes such as BRCA1 and many common, mainly non-coding variants. However, much of the genetic contribution to breast cancer risk remains unknown. We report results from a genome-wide association study (GWAS) of breast cancer in 122,977 cases and 105,974 controls of European ancestry and 14,068 cases and 13,104 controls of East Asian ancestry1. We identified 65 new loci associated with overall breast cancer at p<5x10-8. The majority of credible risk SNPs in the new loci fall in distal regulatory elements, and by integrating in-silico data to predict target genes in breast cells at each locus, we demonstrate a strong overlap between candidate target genes and somatic driver genes in breast tumours. We also find that heritability of breast cancer due to all SNPs in regulatory features was 2-5-fold enriched relative to the genome-wide average, with strong enrichment for particular transcription factor binding sites. These results provide further insight into genetic susceptibility to breast cancer and will improve the utility of genetic risk scores for individualized screening and prevention.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.