BackgroundLung cancer is a good example of the potential benefit of symptom-based diagnosis, as it is the commonest cancer worldwide, with the highest mortality from late diagnosis and poor symptom recognition. The diagnosis and risk assessment tools currently available have been shown to require further validation. In this study, we determine the symptoms associated with lung cancer prior to diagnosis and demonstrate that by separating prior risk based on factors such as smoking history and age, from presenting symptoms and combining them at the individual patient level, we can make greater use of this knowledge to create a practical framework for the symptomatic diagnosis of individual patients presenting in primary care.AimTo provide an evidence-based analysis of symptoms observed in lung cancer patients prior to diagnosis.Design and settingSystematic review and meta-analysis of primary and secondary care data.MethodSeven databases were searched (MEDLINE, Embase, Cumulative Index to Nursing and Allied Health Literature, Health Management Information Consortium, Web of Science, British Nursing Index and Cochrane Library). Thirteen studies were selected based on predetermined eligibility and quality criteria for diagnostic assessment to establish the value of symptom-based diagnosis using diagnosistic odds ratio (DOR) and summary receiver operating characteristic (SROC) curve. In addition, routinely collated real-time data from primary care electronic health records (EHR), TransHis, was analysed to compare with our findings.ResultsHaemoptysis was found to have the greatest diagnostic value for lung cancer, diagnostic odds ratio (DOR) 6.39 (3.32–12.28), followed by dyspnoea 2.73 (1.54–4.85) then cough 2.64 (1.24–5.64) and lastly chest pain 2.02 (0.88–4.60). The use of symptom-based diagnosis to accurately diagnose lung cancer cases from non-cases was determined using the summary receiver operating characteristic (SROC) curve, the area under the curve (AUC) was consistently above 0.6 for each of the symptoms described, indicating reasonable discriminatory power. The positive predictive value (PPV) of diagnostic symptoms depends on an individual’s prior risk of lung cancer, as well as their presenting symptom pattern. For at risk individuals we calculated prior risk using validated epidemiological models for risk factors such as age and smoking history, then combined with the calculated likelihood ratios for each symptom to establish posterior risk or positive predictive value (PPV).ConclusionOur findings show that there is diagnostic value in the clinical symptoms associated with lung cancer and the potential benefit of characterising these symptoms using routine data studies to identify high-risk patients.