Lung cancer is one of the most widespread neoplasms worldwide. To identify the key biomarkers in its carcinogenesis and development, the mRNA microarray datasets GSE102287, GSE89047, GSE67061 and GSE74706 were obtained from the Gene Expression Omnibus database. GEO2R was used to identify the differentially expressed genes (DEGs) in lung cancer. The Database for Annotation, Visualization and Integrated Discovery was used to analyze the functions and pathways of the DEGs, while the Search Tool for the Retrieval of Interacting Genes/Proteins and Cytoscape were used to obtain the protein-protein interaction (PPI) network. Kaplan Meier curves were used to analyze the effect of the hub genes on overall survival (OS). Module analysis was completed using Molecular Complex Detection in Cytoscape, and one co-expression network of these significant genes was obtained with cBioPortal. A total of 552 DEGs were identified among the four microarray datasets, which were mainly enriched in ‘cell proliferation’, ‘cell growth’, ‘cell division’, ‘angiogenesis’ and ‘mitotic nuclear division’. A PPI network, composed of 44 nodes and 886 edges, was constructed, and its significant module had 16 hub genes in the whole network: Opa interacting protein 5, exonuclease 1, PCNA clamp-associated factor, checkpoint kinase 1, hyaluronan-mediated motility receptor, maternal embryonic leucine zipper kinase, non-SMC condensin I complex subunit G, centromere protein F, BUB1 mitotic checkpoint serine/threonine kinase, cyclin A2, thyroid hormone receptor interactor 13, TPX2 microtubule nucleation factor, nucleolar and spindle associated protein 1, kinesin family member 20A, aurora kinase A and centrosomal protein 55. Survival analysis of these hub genes revealed that they were markedly associated with poor OS in patients with lung cancer. In summary, the hub genes and DEGs delineated in the research may aid the identification of potential targets for diagnostic and therapeutic strategies in lung cancer.