Aim: Currently, the obesity epidemic is one of the biggest problems for human health. Obesity is impacted on survival in patients with breast cancer. However, key biomarkers of obesity-related breast cancer risk are still not well known. Thus, using machine learning to identify the most appropriate features in obesity-associated breast cancer patients may improve the predictive accuracy and interpretability of regression models. Methods: In the present study, we identified 23 differentially expressed genes (DEGs) from the GSE24185 transcriptome dataset. Seed genes were identified from DEGs, the co-expression network genes and hub genes of the protein-protein interaction network. Pathway enrichment analysis was performed for DEGs. The Ridge penalty regression model was executed by using P-values of enriched pathways and seed gene pathway association score to obtain the most relevant molecular signatures. The model was performed using 10-fold cross-validation to fit the penalized models. Results: Angiotensin II receptor type 1 (AGTR1), cyclin D1 (CCND1), glutamate ionotropic receptor AMPA type subunit 2 (GRIA2), interleukin-6 cytokine family signal transducer (IL6ST), matrix metallopeptidase 9 (MMP9), and protein kinase CAMP-dependent type II regulatory subunit beta (PRKAR2B) were considered as candidate molecular signatures of obese patients with breast cancer. In addition, RAF-independent MAPK1/3 activation, collagen degradation, bladder cancer, drug metabolism-cytochrome P450, and signaling by Hedgehog pathways in cancer were primarily associated with obesity-associated breast cancer. Conclusion: These genes may be used for risk analysis of the disease progression of obese patients with breast cancer. Corresponding genes and pathways should be validated via experimental studies.
Differential expressions of certain genes during tumorigenesis may serve to identify novel manageable targets in the clinic. In this work with an integrated bioinformatics approach, we analyzed public microarray datasets from Gene Expression Omnibus (GEO) to explore the key differentially expressed genes (DEGs) in non-small cell lung cancer (NSCLC). We identified a total of 984 common DEGs in 252 healthy and 254 NSCLC gene expression samples. The top 10 DEGs as a result of pathway enrichment and protein–protein interaction analysis were further investigated for their prognostic performances. Among these, we identified high expressions of CDC20, AURKA, CDK1, EZH2, and CDKN2A genes that were associated with significantly poorer overall survival in NSCLC patients. On the contrary, high mRNA expressions of CBL, FYN, LRKK2, and SOCS2 were associated with a significantly better prognosis. Furthermore, our drug target analysis for these hub genes suggests a potential use of Trichostatin A, Pracinostat, TGX-221, PHA-793887, AG-879, and IMD0354 antineoplastic agents to reverse the expression of these DEGs in NSCLC patients.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.