The 331 million people of the United States are served by a complex and expensive healthcare system that accounts for nearly 18% of the country's gross domestic product. Over 90% of patients are insured by private or government-funded plans, but despite high coverage and unusually high healthcare spending, vast disparities exist within the United States population based on demographics in terms of diagnosis, treatment, and outcomes of disease. Thoracic surgeons in the United States are trained to treat patients with diseases of the chest in the operative and perioperative settings, and can accomplish this training through multiple highly competitive pathways. Thoracic surgeons perform an average of 135 operations each per year which address diseases of the lungs, trachea, esophagus, chest wall, mediastinum, and diaphragm.Video assisted thoracoscopic surgeries are the most commonly performed procedures, which are primarily completed to treat lung cancer. Lung cancer is the deadliest and second most prevalent malignancy in the United States, with over 200,000 new cases expected this year. In addition to encouragement of smoking cessation and more attention to air pollutants, increased access to lung cancer screening has significantly expedited diagnosis and reduced mortality from lung cancer in the last several years. Thoracic surgeons in the United States are tasked with treating common yet highly morbid diseases of the chest in a patient population that is diverse in terms of race, socioeconomic status, and healthcare insurance coverage. As the population ages and a shortage of thoracic surgeons looms, the importance of early diagnosis, skillful surgical management, and attention to the disparities that exist in our system cannot be overstated.
Background
We aim to develop and test performance of a semi-automated method (computerized query combined with manual review) for chart abstraction in the identification and characterization of surveillance radiology imaging for post-treatment non-small cell lung cancer patients.
Methods
A gold standard dataset consisting of 3011 radiology reports from 361 lung cancer patients treated at the Veterans Health Administration from 2008 to 2016 was manually created by an abstractor coding image type, image indication, and image findings. Computerized queries using a text search tool were performed to code reports. The primary endpoint of query performance was evaluated by sensitivity, positive predictive value (PPV), and F1 score. The secondary endpoint of efficiency compared semi-automated abstraction time to manual abstraction time using a separate dataset and the Wilcoxon rank-sum test.
Results
Query for image type demonstrated the highest sensitivity of 85%, PPV 95%, and F1 score 0.90. Query for image indication demonstrated sensitivity 72%, PPV 70%, and F1 score 0.71. The image findings queries ranged from sensitivity 75–85%, PPV 23–25%, and F1 score 0.36–0.37. Semi-automated abstraction with our best performing query (image type) improved abstraction times by 68% per patient compared to manual abstraction alone (from median 21.5 min (interquartile range 16.0) to 6.9 min (interquartile range 9.5), p < 0.005).
Conclusions
Semi-automated abstraction using the best performing query of image type improved abstraction efficiency while preserving data accuracy. The computerized query acts as a pre-processing tool for manual abstraction by restricting effort to relevant images. Determining image indication and findings requires the addition of manual review for a semi-automatic abstraction approach in order to ensure data accuracy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.