Introduction: Variables, such as smoking and obesity, are rarely available in administrative databases. We explored the added value of including these data in an administrative database study evaluating the association of statin use with survival in kidney cancer.
Methods:We linked administrative data with chart-abstracted data on smoking and obesity for 808 patients undergoing nephrectomy for kidney cancer. Base models consisted of variables from administrative databases (age, sex, year of surgery, and different measures of comorbidity [to compare their sensitivity to smoking and obesity data]); extended models added chart-abstracted data. We compared coefficients for statin use with overall (OS) and cancerspecific survival (CSS), and used the c-statistic and net reclassification improvement (NRI) to compare predications of five-year survival obtained from Cox proportional hazard models. Results: The coefficient for statin use changed minimally following addition of abstracted data (<6% for OS, <2% for CSS).
Conclusions:The inclusion of data on smoking and obesity marginally influences survival models in kidney cancer studies using administrative data.
IntroductionAdministrative databases are being increasingly used in clinical research over institutional clinical databases, as they provide large sample sizes, have strong external generalizability, and can have comprehensive information on followup;1 however, variables such as smoking and obesity are rarely available in administrative databases, but could be available in institutional databases from chart review. Studies using administrative data often note this as a limitation 2,3 and a measure of comorbidity is often used to account for smoking and obesity, since these factors influence various aspects of health and may relate to a person's overall health status. 4 It remains unknown whether the addition of smoking and obesity data would markedly improve risk prediction compared to a model using a measure of comorbidity derived from administrative data. If including these variables is important for risk prediction, researchers should make efforts to obtain these data to improve the reliability of their results. Conversely, if little value is added, this additional exercise may not be worthwhile, given the costs and time associated with the process.We explore this concept using as an example a cohort study evaluating the association of statin use with kidney cancer survival. Statins are commonly prescribed lipidlowering medications that have recently gained interest in the oncology community based on studies showing that their use is associated with improved survival outcomes in various malignancies. 5,6 In a large, population-based cohort study using administrative data, we recently demonstrated that statin use at the time of diagnosis was not significantly associated with cancer-specific (CSS) or overall survival (OS) in kidney cancer patients (Nayan et al. Manuscript in progress). We used a comorbidity score for risk adjustment, but we were unable ...