2021
DOI: 10.1038/s41598-021-02481-y
|View full text |Cite
|
Sign up to set email alerts
|

Using explainable machine learning to characterise data drift and detect emergent health risks for emergency department admissions during COVID-19

Abstract: A key task of emergency departments is to promptly identify patients who require hospital admission. Early identification ensures patient safety and aids organisational planning. Supervised machine learning algorithms can use data describing historical episodes to make ahead-of-time predictions of clinical outcomes. Despite this, clinical settings are dynamic environments and the underlying data distributions characterising episodes can change with time (data drift), and so can the relationship between episode… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
1

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 55 publications
(41 citation statements)
references
References 22 publications
0
31
1
Order By: Relevance
“…However, when the clinical environment is highly dynamic and patient populations are heterogeneous, a model that works well in one-time period or one hospital may fail in another. A recent example is the emergence of COVID-19 24 documented a performance drop in an ML algorithm for determining which patients were at high risk of hospital admission based on their emergency department (ED) presentation that relied on input variables like respiratory rate and arrival mode, which were significantly affected by the spread of COVID-19.…”
Section: Error In Clinical Ai Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, when the clinical environment is highly dynamic and patient populations are heterogeneous, a model that works well in one-time period or one hospital may fail in another. A recent example is the emergence of COVID-19 24 documented a performance drop in an ML algorithm for determining which patients were at high risk of hospital admission based on their emergency department (ED) presentation that relied on input variables like respiratory rate and arrival mode, which were significantly affected by the spread of COVID-19.…”
Section: Error In Clinical Ai Algorithmsmentioning
confidence: 99%
“…Given the complexity of ML algorithms, a number of papers have suggested monitoring ML explainability metrics, such as variable importance (VI) 18 , 24 . The idea is that these metrics provide a more interpretable representation of the data.…”
Section: Monitoring Clinical Ai Algorithmsmentioning
confidence: 99%
“…For XGBoost we tried to tune the number of estimators, (11,13,15,17,19), their depth (7,8,9,10,11,12,13) and learning rate (0.01, 0.1, 0.2, 0.3). Configuring neural networks is more difficult and sometimes computationally heavy, especially for the TabNet model.…”
Section: Model Development and Trainingmentioning
confidence: 99%
“…Singh and colleagues [11] demonstrate the impact of distribution shifts across 179 US facilities on a sepsis prediction model. Duckworth and colleagues [12] show distribution shifts during the COVID-19 pandemic and evaluate their impact on a hospital admission prediction model.…”
Section: Introductionmentioning
confidence: 99%
“…It highlights timed-based drift in the data which influences the choice of threshold to be suboptimal. 40 Based on the AUC, AP, and Brier Score, random forest classifier was chosen for the Jakarta model due to well-calibrated predictions and high training and testing performance. SHAP analysis is then executed using random forest as the model.…”
Section: Jakarta Modelmentioning
confidence: 99%