“…We queried Gene Expression Omnibus (GEO) with the keyword “lung” and with the platform set to “GPL570” or “GPL96,” and then selected datasets with a sample size of more than 30. As a result, we got six datasets on platform GPL570, including GSE10245 (n = 40), 34 GSE19188 (n = 110), 35 GSE30219 (n = 83), 36 GSE31210 (n = 224), 37,38 GSE37745 (n = 91), 39 and GSE50081 (n = 128), 40 and five datasets on platform GPL96, including GSE10072 (n = 107), 41 GSE14814 (n = 71), 42 GSE31547 (n = 50), 43 GSE68465 (n = 353), 44 and GSE7670 (n = 54). 45 Detailed information about the datasets is presented in Table 1; the sample numbers are the result of removing patients treated with chemotherapy or radiotherapy.…”