This study aimed to identify the oncogenes associated with lung cancer based on the mRNA and single nucleotide polymorphism (SNP) profile data. The mRNA expression profile data of GSE43458 (80 cancer and 30 normal samples) and SNP profile data of GSE33355 (61 pairs of lung cancer samples and control samples) were downloaded from Gene Expression Omnibus database. Common genes between the mRNA profile and SNP profile were identified as the lung cancer oncogenes. Risk subpathways of the selected oncogenes with the SNP locus were analyzed using the iSubpathwayMiner package in R. Moreover, protein-protein interaction (PPI) network of the oncogenes was constructed using the HPRD database and then visualized using the Cytoscape. Totally, 3004 DEGs (1105 up-regulated and 1899 down-regulated) and 125 significant SNPs closely related to 174 genes in the lung cancer samples were identified. Also, 39 common genes, like PFKP (phosphofructokinase, platelet) and DGKH-rs11616202 (diacylglycerol kinase, eta) that enriched in sub-pathways such as galactose metabolism, fructose and mannose metabolism, and pentose phosphate pathway, were identified as the lung cancer oncogenes. Besides, PIK3R1 (phosphoinositide-3-kinase, regulatory subunit 1), RORA (RAR-related orphan receptor A), MAGI3 (membrane associated guanylate kinase, WW and PDZ domain containing 3), PTPRM (protein tyrosine phosphatase, receptor type, M), and BMP6 (bone morphogenetic protein 6) were the hub genes in PPI network. Our study suggested that PFKP and DGKH that enriched in galactose metabolism, fructose and mannose metabolism pathway, as well as PIK3R1, RORA, and MAGI3, may be the lung cancer oncogenes.
Key words: lung cancer, single nucleotide polymorphism (SNP), function analysis, differentially expressed gene, oncogenesAbbreviations: SNP -single nucleotide polymorphism; PPI -proteinprotein interaction; DEGs -differentially expressed genes Lung cancer is one of the most common malignancies with an increasing morbidity and mortality and is a worldwide leading cause of cancer-related death with a 5-year survival rate from 13% to 15% [1]. Mechanism of lung cancer is complicate, mainly due to the late diagnosis and lack of effective treatment [2]. Therefore, underlying the molecular profiles of lung cancer as well as elucidating the roles of oncogenes and tumor suppressors in the development of this malignancy is expected to identify the molecular targets for lung cancer prediction and treatment.Previous studies have demonstrated that environmental factors like smoking and air pollution, and gene polymorphisms were the main factors contributing to the lung cancer progression and metastasis [3]. Chemical constituents in tobacco smoke including compounds of the carcinogenic polycyclic type were the carcinogen for lung cancer [4]. A previous study reveals that mutations of the tumor suppressor genes are a main reason for lung cancer progression, such as the prevalence of p53 mutational patterns G to T transversions is 30% and p53 mutations in lung cancer can be attribu...