BackgroundTargeted therapies based on the molecular and histological features of cancer types are becoming standard practice. The most effective regimen in lung cancers is different between squamous cell carcinoma (SCC) and adenocarcinoma (AD). Therefore a precise diagnosis is crucial, but this has been difficult, particularly for poorly differentiated SCC (PDSCC) and AD without a lepidic growth component (non-lepidic AD). Biomarkers enabling a precise diagnosis are therefore urgently needed.MethodsCap Analysis of Gene Expression (CAGE) is a method used to quantify promoter activities across the whole genome by determining the 5’ ends of capped RNA molecules with next-generation sequencing. We performed CAGE on 97 frozen tissues from surgically resected lung cancers (22 SCC and 75 AD), and confirmed the findings by immunohistochemical analysis (IHC) in an independent group (29 SCC and 45 AD).ResultsUsing the genome-wide promoter activity profiles, we confirmed that the expression of known molecular markers used in IHC for SCC (CK5, CK6, p40 and desmoglein-3) and AD (TTF-1 and napsin A) were different between SCC and AD. We identified two novel marker candidates, SPATS2 for SCC and ST6GALNAC1 for AD, as showing comparable performance and complementary utility to the known markers in discriminating PDSCC and non-lepidic AD. We subsequently confirmed their utility at the protein level by IHC in an independent group.ConclusionsWe identified two genes, SPATS2 and ST6GALNAC1, as novel complemental biomarkers discriminating SCC and AD. These findings will contribute to a more accurate diagnosis of NSCLC, which is crucial for precision medicine for lung cancer.Electronic supplementary materialThe online version of this article (doi:10.1186/s12885-016-2792-1) contains supplementary material, which is available to authorized users.