Triple-negative (TN) breast cancer is an aggressive subtype of breast cancer associated with a unique set of epidemiologic and genetic risk factors. We conducted a two-stage genome-wide association study of TN breast cancer (stage 1: 1529 TN cases, 3399 controls; stage 2: 2148 cases, 1309 controls) to identify loci that influence TN breast cancer risk. Variants in the 19p13.1 and PTHLH loci showed genome-wide significant associations (P < 5 × 10(-) (8)) in stage 1 and 2 combined. Results also suggested a substantial enrichment of significantly associated variants among the single nucleotide polymorphisms (SNPs) analyzed in stage 2. Variants from 25 of 74 known breast cancer susceptibility loci were also associated with risk of TN breast cancer (P < 0.05). Associations with TN breast cancer were confirmed for 10 loci (LGR6, MDM4, CASP8, 2q35, 2p24.1, TERT-rs10069690, ESR1, TOX3, 19p13.1, RALY), and we identified associations with TN breast cancer for 15 additional breast cancer loci (P < 0.05: PEX14, 2q24.1, 2q31.1, ADAM29, EBF1, TCF7L2, 11q13.1, 11q24.3, 12p13.1, PTHLH, NTN4, 12q24, BRCA2, RAD51L1-rs2588809, MKL1). Further, two SNPs independent of previously reported signals in ESR1 [rs12525163 odds ratio (OR) = 1.15, P = 4.9 × 10(-) (4)] and 19p13.1 (rs1864112 OR = 0.84, P = 1.8 × 10(-) (9)) were associated with TN breast cancer. A polygenic risk score (PRS) for TN breast cancer based on known breast cancer risk variants showed a 4-fold difference in risk between the highest and lowest PRS quintiles (OR = 4.03, 95% confidence interval 3.46-4.70, P = 4.8 × 10(-) (69)). This translates to an absolute risk for TN breast cancer ranging from 0.8% to 3.4%, suggesting that genetic variation may be used for TN breast cancer risk prediction.
Metastatic recurrence is the leading cause of cancer death in patients with colorectal carcinoma. In order to capture the molecular underpinnings for metastasis and tumor progression, we performed integrative network analysis on 11 independent human colorectal cancer gene expression data sets and applied expression data from an immunocompetent mouse model of metastasis as an additional filter for this biological process. In silico analysis of one metastasis-related co-expression module predicted Nuclear Factor of Activated T-cell (NFAT) transcription factors as potential regulators for the module. Cells selected for invasiveness and metastatic capability expressed higher levels of NFATc1 as compared with poorly metastatic and less invasive parental cells. We found that inhibition of NFATc1 in human and mouse colon cancer cells resulted in decreased invasiveness in culture and down-regulation of metastasis-related network genes. Overexpression of NFATc1 significantly increased the metastatic potential of colon cancer cells while inhibition of NFATc1 reduced metastasis growth in an immunocompetent mouse model. Finally, we found that an 8-gene signature comprising genes up-regulated by NFATc1 significantly correlated with worse clinical outcomes in Stage II and III colorectal cancer patients. Thus, NFATc1 regulates colon cancer cell behavior and its transcriptional targets constitute a novel, biologically-anchored gene expression signature for the identification of colon cancers with high risk of metastatic recurrence.
Modification of proteins by reactive electrophiles such as the 4-hydroxy-2-nonenal (HNE) plays a critical role in oxidant-associated human diseases. However, little is known about protein adduction and the mechanism by which protein damage elicits adaptive effects and toxicity. We developed a systems approach for relating protein adduction to gene expression changes through the integration of protein adduction, gene expression, protein-DNA interaction, and protein-protein interaction data. Using a random walk strategy, we expanded a list of responsive transcription factors inferred from gene expression studies to upstream signaling networks, which in turn allowed overlaying protein adduction data on the network for the prediction of stress sensors and their associated regulatory mechanisms. We demonstrated the general applicability of transcription factor-based signaling network inference using 103 known pathways. Applying our workflow on gene expression and protein adduction data from HNE-treatment not only rediscovered known mechanisms of electrophile stress but also generated novel hypotheses regarding protein damage sensors. Although developed for analyzing protein adduction data, the framework can be easily adapted for phosphoproteomics and other types of protein modification data.
Driver somatic mutations are a hallmark of a tumor that can be used for diagnosis and targeted therapy. Mutations are primarily detected from tumor DNA. As dynamic molecules of gene activities, transcriptome profiling by RNA sequence (RNA-seq) is becoming increasingly popular, which not only measures gene expression but also structural variations such as mutations and fusion transcripts. Although single-nucleotide variants (SNVs) can be easily identified from RNA-seq, intermediate long insertions/deletions (indels > 2 bases and less than sequence reads) cause significant challenges and are ignored by most RNA-seq analysis tools. This study evaluates commonly used RNA-seq analysis programs along with variant and somatic mutation callers in a series of data sets with simulated and known indels. The aim is to develop strategies for accurate indel detection. Our results show that the RNA-seq alignment is the most important step for indel identification and the evaluated programs have a wide range of sensitivity to map sequence reads with indels, from not at all to decently sensitive. The sensitivity is impacted by sequence read lengths. Most variant calling programs rely on hard evidence indels marked in the alignment and the programs with realignment may use soft-clipped reads for indel inferencing. Based on the observations, we have provided practical recommendations for indel detection when different RNA-seq aligners are used and demonstrated the best option with highly reliable results. With careful customization of bioinformatics algorithms, RNA-seq can be reliably used for both SNV and indel mutation detection that can be used for clinical decision-making.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.