Development of a Clinical Reasoning Documentation Assessment Tool for Resident and Fellow Admission Notes: a Shared Mental Model for Feedback

Schaye, Verity; Miller, Louis H.; Kudlowitz, David; Chun, Jonathan W.; Burk-Rafel, Jesse; Cocks, Patrick; Guzman, Benedict; Aphinyanaphongs, Yindalon; Marin, Marina

doi:10.1007/s11606-021-06805-6

Cited by 17 publications

(10 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Section: Supplemental Contentmentioning

confidence: 99%

“…Primary outcome was the Revised-IDEA (R-IDEA) score, a validated 10-point scale evaluating 4 core domains of clinical reasoning documentation (eTable 3 in Supplement 1). 6 To establish reliability, we (D.R., Z.K., A.R.) independently scored 29 section responses from 8 nonparticipants, showing substantial scoring agreement (mean Cohen weighted κ = 0.61).…”

mentioning

confidence: 99%

“…Physicians' vignette responses may differ from clinical practice; however, physician R-IDEA scores in clinical documentation were lower than physician scores in this study. 6 We also used a zero-shot approach for chatbot's prompt. Iterative training could enhance LLM performance, suggesting that the results may have underestimated its capabilities.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Clinical Reasoning of a Generative Artificial Intelligence Model Compared With Physicians

Cabral,

Restrepo,

Kanjee

et al. 2024

JAMA Intern Med

View full text Add to dashboard Cite

show abstract

Section: Supplemental Contentmentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Clinical Reasoning of a Generative Artificial Intelligence Model Compared With Physicians

Cabral,

Restrepo,

Kanjee

et al. 2024

JAMA Intern Med

View full text Add to dashboard Cite

show abstract

“…23 For instance, models like Schaye et al's ML model for automated assessment of resident clinical reasoning documentation are examples of supervised ML that use text-based labeled datasets. 24 Such models help overcome traditional barriers in medical education assessment by providing a sufficient number of assessment inputs and consistency in standards of assessment. 25 This automated assessment of clinical reasoning documentation helps overcome the barriers in medical education assessment of obtaining sufficient number of assessment inputs and consistency in standards of assessment.…”

Section: Proactive Data Collectionmentioning

confidence: 99%

“…AI is increasingly being used in the assessment of physician competence across various levels of learners (undergraduate medical education [UME], graduate medical education [GME], and continuing medical education [CME]), competency domains (e.g., medical knowledge and patient care), and different types of data input (e.g., text vs video) and AI technologies (e.g., supervised vs unsupervised ML) 23 . For instance, models like Schaye et al’s ML model for automated assessment of resident clinical reasoning documentation are examples of supervised ML that use text-based labeled datasets 24 . Such models help overcome traditional barriers in medical education assessment by providing a sufficient number of assessment inputs and consistency in standards of assessment 25 .…”

Section: Use Of Ai In Precision Educationmentioning

confidence: 99%

Demystifying AI: Current State and Future Role in Medical Education Assessment

Turner,

Hashimoto,

Vasisht

et al. 2023

Acad Med

View full text Add to dashboard Cite

Medical education assessment faces multifaceted challenges, including data complexity, resource constraints, bias, feedback translation, and educational continuity. Traditional approaches often fail to adequately address these issues, creating stressful and inequitable learning environments. This article introduces the concept of precision education, a data-driven paradigm aimed at personalizing the educational experience for each learner. It explores how artificial intelligence (AI), including its subsets machine learning (ML) and deep learning (DL), can augment this model to tackle the inherent limitations of traditional assessment methods. AI can enable proactive data collection, offering consistent and objective assessments while reducing resource burdens. It has the potential to revolutionize not only competency assessment but also participatory interventions, such as personalized coaching and predictive analytics for at-risk trainees. The article also discusses key challenges and ethical considerations in integrating AI into medical education, such as algorithmic transparency, data privacy, and the potential for bias propagation. AI’s capacity to process large datasets and identify patterns allows for a more nuanced, individualized approach to medical education. It offers promising avenues not only to improve the efficiency of educational assessments but also to make them more equitable. However, the ethical and technical challenges must be diligently addressed. The article concludes that embracing AI in medical education assessment is a strategic move toward creating a more personalized, effective, and fair educational landscape. This necessitates collaborative, multidisciplinary research and ethical vigilance to ensure that the technology serves educational goals while upholding social justice and ethical integrity.

show abstract

Large Language Model Influence on Diagnostic Reasoning

Goh,

Gallo,

Hom

et al. 2024

JAMA Netw Open

View full text Add to dashboard Cite

ImportanceLarge language models (LLMs) have shown promise in their performance on both multiple-choice and open-ended medical reasoning examinations, but it remains unknown whether the use of such tools improves physician diagnostic reasoning.ObjectiveTo assess the effect of an LLM on physicians’ diagnostic reasoning compared with conventional resources.Design, Setting, and ParticipantsA single-blind randomized clinical trial was conducted from November 29 to December 29, 2023. Using remote video conferencing and in-person participation across multiple academic medical institutions, physicians with training in family medicine, internal medicine, or emergency medicine were recruited.InterventionParticipants were randomized to either access the LLM in addition to conventional diagnostic resources or conventional resources only, stratified by career stage. Participants were allocated 60 minutes to review up to 6 clinical vignettes.Main Outcomes and MeasuresThe primary outcome was performance on a standardized rubric of diagnostic performance based on differential diagnosis accuracy, appropriateness of supporting and opposing factors, and next diagnostic evaluation steps, validated and graded via blinded expert consensus. Secondary outcomes included time spent per case (in seconds) and final diagnosis accuracy. All analyses followed the intention-to-treat principle. A secondary exploratory analysis evaluated the standalone performance of the LLM by comparing the primary outcomes between the LLM alone group and the conventional resource group.ResultsFifty physicians (26 attendings, 24 residents; median years in practice, 3 [IQR, 2-8]) participated virtually as well as at 1 in-person site. The median diagnostic reasoning score per case was 76% (IQR, 66%-87%) for the LLM group and 74% (IQR, 63%-84%) for the conventional resources-only group, with an adjusted difference of 2 percentage points (95% CI, −4 to 8 percentage points; P = .60). The median time spent per case for the LLM group was 519 (IQR, 371-668) seconds, compared with 565 (IQR, 456-788) seconds for the conventional resources group, with a time difference of −82 (95% CI, −195 to 31; P = .20) seconds. The LLM alone scored 16 percentage points (95% CI, 2-30 percentage points; P = .03) higher than the conventional resources group.Conclusions and RelevanceIn this trial, the availability of an LLM to physicians as a diagnostic aid did not significantly improve clinical reasoning compared with conventional resources. The LLM alone demonstrated higher performance than both physician groups, indicating the need for technology and workforce development to realize the potential of physician-artificial intelligence collaboration in clinical practice.Trial RegistrationClinicalTrials.gov Identifier: NCT06157944

show abstract

Development of a Clinical Reasoning Documentation Assessment Tool for Resident and Fellow Admission Notes: a Shared Mental Model for Feedback

Cited by 17 publications

References 29 publications

Clinical Reasoning of a Generative Artificial Intelligence Model Compared With Physicians

Clinical Reasoning of a Generative Artificial Intelligence Model Compared With Physicians

Demystifying AI: Current State and Future Role in Medical Education Assessment

Large Language Model Influence on Diagnostic Reasoning

Contact Info

Product

Resources

About