Background: The timeliness of detection of a sepsis incidence in progress is a crucial factor in the outcome for the patient. Machine learning models built from data in electronic health records can be used as an effective tool for improving this timeliness, but so far the potential for clinical implementations has been largely limited to studies in intensive care units. This study will employ a richer data set that will expand the applicability of these models beyond intensive care units. Furthermore, we will circumvent several important limitations that have been found in the literature: 1) Models are evaluated shortly before sepsis onset without considering interventions already initiated. 2) Machine learning models are built on a restricted set of clinical parameters, which are not necessarily measured in all departments. 3) Model performance is limited by current knowledge of sepsis, as feature interactions and time dependencies are hard-coded into the model. Methods: In this study, we present a model to overcome these shortcomings using a deep learning approach on a diverse multicenter data set. We used retrospective data from multiple Danish hospitals over a seven-year period. Our sepsis detection system is constructed as a combination of a convolutional neural network and a long short-term memory network. We suggest a retrospective assessment of interventions by looking at intravenous antibiotics and blood cultures preceding the prediction time. Results: Results show performance ranging from AUROC 0.856 (3 hours before sepsis onset) to AUROC 0.756 (24 hours before sepsis onset). Evaluating the clinical utility of the model, we find that a large proportion of septic patients did not receive antibiotic treatment or blood culture at the time of the sepsis prediction, and the model could therefore facilitate such interventions at an earlier point in time. Conclusion: We present a deep learning system for early detection of sepsis that is able to learn characteristics of the key factors and interactions from the raw event sequence data itself, without relying on a labor-intensive feature extraction work. Our system outperforms baseline models, such as gradient boosting, which rely on specific data elements and therefore suffer from many missing values in our dataset.
Background: Sepsis is a clinical condition involving an extreme inflammatory response to an infection, and is associated with high morbidity and mortality. Without intervention, this response can progress to septic shock, organ failure and death. Every hour that treatment is delayed mortality increases. Early identification of sepsis is therefore important for a positive outcome. Methods: We constructed predictive models for sepsis detection and performed a register-based cohort study on patients from four Danish municipalities. We used event-sequences of raw electronic health record (EHR) data from 2013 to 2017, where each event consists of three elements: a timestamp, an event category (e.g. medication code), and a value. In total, we consider 25.622 positive (SIRS criteria) sequences and 25.622 negative sequences with a total of 112 million events distributed across 64 different hospital units. The number of potential predictor variables in raw EHR data easily exceeds 10.000 and can be challenging for predictive modeling due to this large volume of sparse, heterogeneous events. Traditional approaches have dealt with this complexity by curating a limited number of variables of importance; a labor-intensive process that may discard a vast majority of information. In contrast, we consider a deep learning system constructed as a combination of a convolutional neural network (CNN) and long short-term memory (LSTM) network. Importantly, our system learns representations of the key factors and interactions from the raw event sequence data itself. Results: Our model predicts sepsis with an AUROC score of 0.8678, at 11 hours before actual treatment was started, outperforming all currently deployed approaches. At other prediction times, the model yields following AUROC scores. 15 min: 0.9058, 3 hours: 0.8803, 24 hours: 0.8073. Conclusion: We have presented a novel approach for early detection of sepsis that has more true positives and fewer false negatives than existing alarm systems without introducing domain knowledge into the model. Importantly, the model does not require changes in the daily workflow of healthcare professionals at hospitals, as the model is based on data that is routinely captured in the EHR. This also enables real-time prediction, as healthcare professionals enters the raw events in the EHR.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.