Contextual Embeddings from Clinical Notes Improves Prediction of Sepsis

Amrollahi, Fatemeh; Shashikumar, Supreeth P.; Razmi, Fereshteh; Nemati, Shamim

doi:10.1101/2021.03.02.21252779

Cited by 8 publications

(24 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Of the 9 identified articles, 2 studies aimed at identifying infection, 47 , 48 6 studies focused on early detection of sepsis, 51 , 53 , 55 severe sepsis, 49 or septic shock, 50 , 54 and 1 study considered both identification and early detection for a combination of sepsis, severe sepsis, and septic shock. 52 Most studies focused on intensive care unit (ICU) 48 , 50 , 52–55 or emergency department (ED) 47 , 51 data; only 1 used inpatient care data.…”

Section: Resultsmentioning

confidence: 99%

“…49 Four studies utilized data from hospitals, 47 , 49 , 51 , 52 1 utilized MIMIC-II 54 and 4 utilized MIMIC-III. 48 , 50 , 53 , 55 MIMIC-II and MIMIC-III are publicly available ICU datasets created from Boston’s Beth Israel Deaconess Medical Center; MIMIC-II contains data from 2001–2007 76 and MIMIC-III contains data from 2001–2012. 77 Eight studies used data from the United States 47–51 , 53–55 and 1 study used data from Singapore.…”

Section: Resultsmentioning

confidence: 99%

“… 48 , 50 , 53 , 55 MIMIC-II and MIMIC-III are publicly available ICU datasets created from Boston’s Beth Israel Deaconess Medical Center; MIMIC-II contains data from 2001–2007 76 and MIMIC-III contains data from 2001–2012. 77 Eight studies used data from the United States 47–51 , 53–55 and 1 study used data from Singapore. 52 Sample sizes varied greatly in terms of the number of patients or notes used.…”

Section: Resultsmentioning

confidence: 99%

“… 52 Sample sizes varied greatly in terms of the number of patients or notes used. To select patient cohorts or notes associated with sepsis, 3 studies used International Statistical Classification of Diseases and Related Health Problems (ICD) codes, 47 , 49 , 52 5 applied sepsis definition criteria, 49–51 , 53 , 55 1 utilized descriptions of antibiotics usage, 48 and another 54 applied criteria from Henry et al 78 that include ICD codes, sepsis criteria, and notes mentioning sepsis or septic shock. Table 1 summarizes the study characteristics and additional details are in Supplementary Table S2 (for Culliton et al, 49 the 8 structured variables for the Modified Baystate clinical definition of severe sepsis and 29 structured variables used in models were provided through personal communications with the corresponding author of Culliton et al, 49 Steve Gallant, on June 4, 2021).…”

Section: Resultsmentioning

confidence: 99%

“…Hence, a search on May 15, 2021, including those terms, found 2 additional articles. 53 , 54 To ensure inclusion of other relevant articles, a broader search was conducted on September 3, 2021 to include the following terms: “unstructured data,” “unstructured text,” “clinical note,” “clinical notes,” “clinical text,” “free-text,” “free text,” “record text,” “narrative,” or “narratives.” This resulted in 1 additional article. 55…”

Section: Methodsmentioning

confidence: 99%

See 4 more Smart Citations

Sepsis prediction, early detection, and identification using clinical text for machine learning: a systematic review

Yan

Gustad

Nytrø

2021

Journal of the American Medical Informatics Association

View full text Add to dashboard Cite

Objective To determine the effects of using unstructured clinical text in machine learning (ML) for prediction, early detection, and identification of sepsis. Materials and methods PubMed, Scopus, ACM DL, dblp, and IEEE Xplore databases were searched. Articles utilizing clinical text for ML or natural language processing (NLP) to detect, identify, recognize, diagnose, or predict the onset, development, progress, or prognosis of systemic inflammatory response syndrome, sepsis, severe sepsis, or septic shock were included. Sepsis definition, dataset, types of data, ML models, NLP techniques, and evaluation metrics were extracted. Results The clinical text used in models include narrative notes written by nurses, physicians, and specialists in varying situations. This is often combined with common structured data such as demographics, vital signs, laboratory data, and medications. Area under the receiver operating characteristic curve (AUC) comparison of ML methods showed that utilizing both text and structured data predicts sepsis earlier and more accurately than structured data alone. No meta-analysis was performed because of incomparable measurements among the 9 included studies. Discussion Studies focused on sepsis identification or early detection before onset; no studies used patient histories beyond the current episode of care to predict sepsis. Sepsis definition affects reporting methods, outcomes, and results. Many methods rely on continuous vital sign measurements in intensive care, making them not easily transferable to general ward units. Conclusions Approaches were heterogeneous, but studies showed that utilizing both unstructured text and structured data in ML can improve identification and early detection of sepsis.

show abstract