Diagnostic Errors in Hospitalized Adults Who Died or Were Transferred to Intensive Care

Auerbach, Andrew D.; Lee, Tiffany M.; Hubbard, Colin C.; Ranji, Sumant R.; Raffel, Katie; Valdes, Gilmer; Boscardin, John; Dalal, Anuj K.; Harris, Alyssa; Flynn, Ellen; Schnipper, Jeffrey L.; ,; Feinbloom, David; Roy, Bethany N.; Herzig, Shoshana J.; Wazir, Mohammed; Gershanik, Esteban F.; Goyal, Abhishek; Chitneni, Pooja R.; Burney, Sharran; Galinsky, Janice; Rastegar, Sarah; Moore, Danielle; Berdahl, Carl; Seferian, Edward G.; Suri, Krithika; Ramishvili, Tea; Vedamurthy, Deepak; Hunt, Daniel P.; Mehta, Amisha S.; Katakam, Haritha; Field, Stephanie A.; Karatasakis, Barbara; Beeler, Katharina; Himmel, Allison M.; Eid, Shaker; Gandhi, Sonal; Pena, Ivonne M.; Ranta, Zachary S.; Lipten, Samuel D.; Lucier, David J.; Walker-Corkery, Beth; Kleinman Sween, Jennifer; Kirchoff, Robert W.; Rieck, Katie M.; Kolar, Gururaj J.; Parikh, Riddhi S.; Burton, Caroline; Dugani, Chandrasagar; Dapaah-Afriyie, Kwame; Finn, Arkadiy; Raju, Sushma B.; Surani, Asif; Segon, Ankur; Bhandari, Sanjay; Astik, Gopi J.; O’Leary, Kevin J.; Helminski, A. Shams; Anstey, James; Zhou, Mengyu; Alday, Angela E.; Halvorson, Stephanie A.C.; Esmaili, Armond M.; Barish, Peter; Fenton, Cynthia; Kantor, Molly; Choi, Kwang Jin; Schram, Andrew W.; Ruhnke, Gregory; Patel, Hemali; Virapongse, Anunta; Burden, Marisha; Ngov, Li-Kheng; Keniston, Angela; Talari, Preetham; Romond, John B.; Vick, Sarah E.; Williams, Mark V.; Marr, Ruby A.; Gupta, Ashwin B.; Rohde, Jeffrey M.; Mao, Frances; Fang, Michele M.; Greysen, S. Ryan; Shah, Pranav; Kim, Christopher S.; Narayanan, Maya; Wolpaw, Benjamin; Ellingson, Sonja M.; Kaiksow, Farah A.; Kenik, Jordan S.; Sterken, David; Lewis, Michelle E.; Manwani, Bhavish R.; Ledford, Russell W.; Webber, Chase J.; Vasilevskis, Eduard E.; Buckley, Ryan J.; Kripalani, Sunil B.; Sankey, Christopher; Ostfeld-Johns, Sharon R.; Gielissen, Katherine; Wijesekera, Thilan; Jordan, Eric; Karwa, Abhishek; Churnet, Bethlehem; Chia, David; Brooks, Katherine

doi:10.1001/jamainternmed.2023.7347

Cited by 26 publications

(3 citation statements)

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…15 Envision an integration of this generated ranked list of diagnoses that would exist in line with clinicians' current workflow, appearing as an optional, editable block of text while writing the "assessment and plan" section of a note. This could hint to a hospitalist of late-onset medication adverse effect, such as those from immunotherapy given months prior or add consideration to salicylate toxicity in a patient presenting with gastrointestinal bleed from nonsteroidal antiinflammatory overuse, a recently highlighted diagnostic miss in Auerbach et al 16 It is our opinion that this collaborative approach between human clinicians and AI will be synergistic and will improve diagnostic accuracy and enhance patient care overall. AI will undoubtedly change how clinicians practice as it is incorporated into healthcare.…”

Section: Differential Diagnosis Supportmentioning

confidence: 99%

Diagnostic reasoning in the age of artificial intelligence: Synergy or opposition?

Gleber,

Fear

2024

Journal of Hospital Medicine

View full text Add to dashboard Cite

Section: Differential Diagnosis Supportmentioning

confidence: 99%

Diagnostic reasoning in the age of artificial intelligence: Synergy or opposition?

Gleber,

Fear

2024

Journal of Hospital Medicine

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 99%

“…Medical diagnosis is a high-stakes cognitive process that takes place in time-constrained and stressful clinical environments. Diagnostic errors are common and contribute to significant patient harm 1,2,3,4,5,6 . Strategies to reduce diagnostic errors include a variety of educational, reflective, and team-based practices.…”

Section: Introductionmentioning

confidence: 99%

Influence of a Large Language Model on Diagnostic Reasoning: A Randomized Clinical Vignette Study

Goh,

Gallo,

Hom

et al. 2024

Preprint

View full text Add to dashboard Cite

Importance: Diagnostic errors are common and cause significant morbidity. Large language models (LLMs) have shown promise in their performance on both multiple-choice and open-ended medical reasoning examinations, but it remains unknown whether the use of such tools improves physician performance. Objective: To assess the impact of the GPT-4 LLM on physicians diagnostic reasoning compared to conventional resources. Design: Multi-center, randomized clinical vignette study. Setting: The study was conducted using remote video conferencing with physicians across the country and in-person participation across multiple academic medical institutions. Participants: Resident and attending physicians with training in family medicine, internal medicine, or emergency medicine. Intervention(s): Participants were randomized to access GPT-4 in addition to conventional diagnostic resources or to just conventional resources. They were allocated 60 minutes to review up to six clinical vignettes adapted from established diagnostic reasoning exams. Main Outcome(s) and Measure(s): The primary outcome was diagnostic performance based on differential diagnosis accuracy, appropriateness of supporting and opposing factors, and next diagnostic evaluation steps. Secondary outcomes included time spent per case. Results: 50 physicians (26 attendings, 24 residents) participated, with an average of 5.2 cases completed per participant. The median diagnostic reasoning score per case was 76.3 percent (IQR 65.8 to 86.8) for the GPT-4 group and 73.7 percent (IQR 63.2 to 84.2) for the conventional resources group, with an adjusted difference of 1.6 percentage points (95% CI -4.4 to 7.6; p=0.60). The median time spent on cases for the GPT-4 group was 519 seconds (IQR 371 to 668 seconds), compared to 565 seconds (IQR 456 to 788 seconds) for the conventional resources group, with a time difference of -82 seconds (95% CI -195 to 31; p=0.20). GPT-4 alone scored 15.5 percentage points (95% CI 1.5 to 29, p=0.03) higher than the conventional resources group. Conclusions and Relevance: In a clinical vignette-based study, the availability of GPT-4 to physicians as a diagnostic aid did not significantly improve clinical reasoning compared to conventional resources, although it may improve components of clinical reasoning such as final diagnosis accuracy. GPT-4 alone demonstrated higher performance than both physician groups, suggesting opportunities for further improvement in physician-AI collaboration in clinical practice.

show abstract

Protecting Patients by Reducing Diagnostic Error

Zhang,

Gross

2024

JAMA Intern Med

View full text Add to dashboard Cite

Almost a decade after the release of "Improving Diagnosis in Health Care," the National Academies of Sciences, Engineering, and Medicine report that highlighted the imperative to improve the diagnostic process in health care, diagnostic errors continue to be a cause of patient harm and death. 1 In JAMA Internal Medicine, Auerbach et al 2 investigated diagnostic errors among hospitalized patients who had experienced a major clinical deterioration, defined as either death or requiring an intensive care unit transfer. Their study of 2428 patient records across 29 academic medical centers in the US found that 23% had experienced a diagnostic error, with 17% of patients experiencing harm or death as a result.While these findings are striking, it is important to highlight that this was a selected sample of the sickest patients in the hospital. Some of these patients may have had poor outcomes regardless of the errors. Subsequent research could incorporate comparison groups of patients who were admitted with similar diagnoses and clinical severity and assess the association between diagnostic error and subsequent outcome.

show abstract

Diagnostic Errors in Hospitalized Adults Who Died or Were Transferred to Intensive Care

Cited by 26 publications

References 45 publications

Diagnostic reasoning in the age of artificial intelligence: Synergy or opposition?

Diagnostic reasoning in the age of artificial intelligence: Synergy or opposition?

Influence of a Large Language Model on Diagnostic Reasoning: A Randomized Clinical Vignette Study

Protecting Patients by Reducing Diagnostic Error

Contact Info

Product

Resources

About